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Introduction 


The SuperH RISC engine family incorporates a RISC (Reduced Instruction Set Computer) type 
CPU. A basic instruction can be executed in one. clock cycle, realizing high performance 
operation. A built-in multiplier can execute multiplication and addition as quickly as DSP. 


The SuperH RISC engine has SH-I CPU, SH-2 CPU, and SH-3 CPU cores. 


The SH-1 CPU, SH-2 CPU and SH-3 CPU have an instruction system with upward compatibility 
at the binary level. 


SH-3 CPU MMU support 


SH-2 CPU Operation instruction enhancement] 68 instruction 





SH-1 CPU 62 instructions 
56 basic instructions 


Refer to the programming manual for the method of executing the instructions or for the 
architecture. You can also refer to this programming manual to know the operation of the pipe 
line, which is one of the features of the RISC CPU. 


This programming manual describes in detail the instructions for the SH-1 CPU and SH-2 CPU 
instructions. For the SH-3 CPU, refer to the separate volume of SH-3 CPU programming manual. 


For the hardware, refer to individual hardware manuals for each unit. 


Organization of This Manual 


Table 1 describes how this manual is organized. Table 2 lists the relationships between the items 
and the sections listed within this manual that cover those items. 


Table 1 


Category 
Introduction 
Architecture (1) 


Introduction to 
instructions 


Detailed information 
on instructions 


Architecture (2) 


Instruction code 


Manual Organization 


Section Title 
1. Features 


2. Register 
Configuration 


3. Data Formats 


4. Instruction 
Features 


5. Instruction Sets 


6. Instruction 
Descriptions 


7. Pipeline Operation 


Appendixes: 
Instruction Code 


Contents 
CPU features 


Types and configuration of general registers, 
control registers and system registers 


Data formats for registers and memory 


Instruction features, addressing modes, and 
instruction formats 


Summary of instructions by Caley and list in 
alphabetic order 


Operation of each instruction in alphabetical order 


Pipeline flow, and pipeline flows with operation for 
each instruction 


Operation code map 


Table 2 Subjects and Corresponding Sections 


Category 


Introduction and 
features 


Architecture 


Introduction to 
instructions 


List of 
instructions 


Detailed 
information on 
instructions 


Topic 
CPU features 
Instruction features 


Pipelines 


Register configuration 
Data formats 

Pipeline operation 
Instruction features 
Addressing modes 
Instruction formats 


Instruction sets 


Instruction code 


Detailed information on instruction 
operation 


Number of instruction execution states 


Section Title 
1. Features | 
4.1 RISC-Type Instruction Set 


7.1 Basic Configuration of 
Pipelines 


7.2 Slot and Pipeline Flow 
2. Register Configuration 
3. Data Formats 

7. Pipeline Operation 

4. Instruction Features 
4.2 Addressing Modes 
4.3 Instruction Formats 


5.1 Instruction Set by 
Classification 
5.2 Instruction Set in 
Alphabetical Order 
Appendix A.1_ Instruction Set by 
Addressing Mode 
Appendix A.2_ Instruction Set by 
Instruction Format 


Appendix A.3 Instruction Set in 
Order by 
Instruction Code 


Appendix A.4 Operation Code 
Map 


6. Instruction Description 


7.7 \nstruction Pipeline 
Operations 


7.3 Number of Instruction 
Execution States 


Functions Listed by CPU Type 


This manual is common for both the SH-1 and SH-2 CPU. However, not all CPUs can use all the 
instructions and functions. Table 3 lists the usable functions by CPU type. | | 


Table3 —‘ Functions by CPU Type 


Item 


Instructions 


BF/S 

BRAF 

BSRF 

BT/S 

DMULS.L 
DMULU.L 

DT 

MAC.L 

MAC.W*? (MAC)*2 


~ MULLL 


States for multiplication 


operation 


All others 
16 x 16 > 32 


(MULS.W, MULU.W)*2 
32 x 32 — 32 (MUL.L) 


32 x 32 — 64 


SH-1 CPU 

No 

No 

No 

No 

No 

No 

No 

No 

16x 16+ 42— 


42 


No 
Yes 


Executed in 1-3* 
states 


No 
No 


SH-2 CPU 
Yes 
Yes 


Yes 


Yes 


Yes 
Yes 
Yes 
Yes 
16 x 16 + 64 > 64 


Yes 
Yes 
Executed in 1-3**states 


Executed in 2-4 *Sstates 
Executed in 2-4 *3states 


(DMULS.L, DMULU.L) 


States for multiply and 


accumulate 


Notes: 1. 


operation 


MAC.W works 


number in contention with preceding/following instructions). 


16 x 16 + 42 — 42 
(SH-1, MAC.W) 


16 x 16 + 64 — 64 
(SH-2, MAC.W) 


32 x 32 + 64 > 64 


_(MAC.L) 


Executed in 
3/(2)*5 states 


No 


No © 


differently on different LSls. 
MAC and MAC.W are the same. MULS is also the same as MULS.W and MULU the 
same as MULU.W. 


The normal minimum number of execution cycles (The number in parentheses in the 


No 
Executed in states 3/(2)*% 


Executed in 2—4 states 
3/(2~4)*8 
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Section 1 Features 


The SH-1 and SH-2 CPU have RISC-type instruction sets. Basic instructions are executed in one 
clock cycle, which dramatically improves instruction execution speed. The CPU also has an 
internal 32-bit architecture for enhanced data processing ability. Table 1.1 lists the SH-1 and SH-2 


CPU features. 


Table 1.1 SH-1 and SH-2 CPU Features 


Item Feature 


Architecture ° 


General-register machine 


Instruction set 


Instruction execution time = 


Address space 


On-chip multiplier 


(SH-1 CPU) 
On-chip multiplier ° 
(SH-2 CPU) 
Pipeline . 
Processing states . 
Power-down states ° 


Original Hitachi architecture 

32-bit internal data paths 

Sixteen 32-bit general registers 

Three 32-bit control registers 

Four 32-bit system registers 

Instruction length: 16-bit fixed length for improved code efficiency 


Load-store architecture (basic arithmetic and logic operations are 
executed between registers) 


Delayed branch system used for reduced pipeline disruption 
Instruction set optimized for C language 

One instruction/cycle for basic instructions 

Architecture makes 4 Gbytes available 


Multiplication operations (16 bits x 16 bits — 32 bits) executed in 1 
to 3 cycles, and multiplication/accumulation operations (16 bits x 16 
bits + 42 bits — 42 bits) executed in 3/(2)* cycles 


Multiplication operations executed in 1 to 2 cycles (16 bits x 16 bits 
— 32 bits) or 2 to 4 cycles (82 bits x 32 bits — 64 bits), and 
multiplication/accumulation operations executed in 3/(2)*cycles (16 
bits x 16 bits + 64 bits > 64 bits) or 3/(2 to 4)* cycles (82 bits x 32 
bits + 64 bits > 64 bits) 


Five-stage pipeline 

Reset state 

Exception processing state 
Program execution state 
Power-down state 

Bus release state 

Sleep mode 

Standby mode 


Note: The normal minimum number of execution cycles (The number in parentheses in the 
mumber in contention with preceding/following instructions). 


Section 2 Register Configuration 


The register set consists of sixteen 32-bit ene registers, three 32-bit control registers and four 
32-bit system registers. 


2.1. General Registers 


There are 16 general registers (Rn) numbered RO-R15, which are 32 bits in length (figure 2.1). 
General registers are used for data processing and address calculation. RO is also used as an index 
register. Several instructions use RO as a fixed source or destination register. R15 is used as the 
hardware stack pointer (SP). Saving and recovering the status register (SR) and program counter 
(PC) in exception processing is accomplished by referencing the stack using R15. 


1. RO functions as an index register in the 
indirect indexed register addressing 
_ mode and indirect indexed GBR 
addressing mode. In some instructions, 
RO functions as a fixed source register 
or destination register. 


=r = (hardware stack pointer) *2| 2. R15 functions as a hardware stack 


pointer (SP) during exception 
processing. 





Figure 2.1 General Registers 


2.2 | Control Registers 


The 32-bit control registers consist of the 32-bit status register (SR), global base register (GBR), 
and vector base register (VBR) (figure 2.2). The status register indicates processing states. The 
global base register functions as a base address for the indirect GBR addressing mode to transfer 


data to the registers of on-chip peripheral modules. The vector base register functions as the base 
address of the exception processing vector area (including interrupts). 


98 76543210 
MQI3 12 11 10 -- ST | SR: Status register 


T bit: The MOVT, CMP/cond, TAS, TST, 
BT (BT/S), BF (BF/S), SETT, and CLRT 
instructions use the T bit to indicate 
true (1) or false (0). The ADDV/C, 
SUBV/C, DIVOU/S, DIV1, NEGC, 
SHAR/L, SHLR/L, ROTR/L, and 
ROTCRIL instructions also use bit T 
to indicate carry/borrow or overflow/ 
underflow 

S bit: Used by the multiply/accumulate 
instruction. 


Reserved bits: Always reads as 0, and should 
always be written with 0. 


Bits I3—l0: Interrupt mask bits. 


M and Q bits: Used by the DIVOU/S and 
DIV1 instructions. 


Global base register (GBR): 

0 Indicates the base address of the indirect 
GBR addressing mode. The indirect GBR 
addressing mode is used in data transfer 
for on-chip peripheral module register 
areas and in logic operations. 


O Vector base register (VBR): 
Indicates the base address of the exception 
processing vector area. 





Figure 2.2 Control Registers 


2.3 System Registers 


The system registers consist of four 32-bit registers: high and low multiply and accumulate registers 
(MACH and MACL), the procedure register (PR), and the program counter (PC) (figure 2.3). The 
multiply and accumulate registers store the results of multiply and accumulate operations. The 
procedure register stores the return address from the subroutine procedure. The program counter 
stores program addresses to control the flow of the processing. 


Multiply and accumulate (MAC) 
registers high and low (MACHIL): 
Store the results of multiply and 
accumulate operations. In the — 
SH-1 CPU, MACH is sign-extended 
to 32 bits when read because only 
the lowest 10 bits are valid. In the 
SH-2 CPU, all 32 bits of MACH are 
valid. 


Procedure register (PR): Stores a 
return address from a subroutine 
procedure. 


Program counter (PC): Indicates the 
fourth byte (second instruction) after 
the current instruction. 





Figure 2.3 System Registers 


2.4 Initial Values of Registers 


Table 2.1 lists the values of the registers after reset. 


Table 2.1 Initial Values of Registers 


Classification Register Initial Value 
General register RO-R14 Undefined 
R15 (SP) Value of the stack pointer in the vector address table 
Control register SR Bits 18-0 are 1111 (H'F), reserved bits are 0, and 
other bits are undefined 
GBR Undefined 
VBR H'00000000 
System register MACH, MACL,PR ~ Undefined 
PC Value of the program counter in the vector address 
table 


Section 3. Data Formats 


3.1 Data Format in Registers 


Register operands are always longwords (32 bits) (figure 3.1). When the memory operand is only 
a byte (8 bits) or a word (16 bits), it is sign-extended into a longword when loaded into a register. 


31 0 


Figure 3.1 Longword Operand 





3.2. Data Format in Memory 


Memory data formats are classified into bytes, words, and longwords. Byte data can be accessed 
from any address, but an address error will occur if you try to access word data starting from an 
address other than 2n or longword data starting from an address other than 4n. In such cases, the 
data accessed cannot be guaranteed (figure 3.2). The hardware stack area, which is referred to by 
the hardware stack pointer (SP, R15), uses only longword data starting from address 4n because 
this area holds the program counter and status register. See the SH Hardware Manual for more 
information on address errors. 


Address m+ 1 Address m +3 


Address m_ | Address m+ 2 
23 15y¥ 7490 


Address 2n-» Word 
Address 4ne|—Longword 


Big endian 





Figure 3.2 Byte, Word, and Longword Alignment 


SH7604 has a function that allows access of CS2 space (area 2) in little endian format, which 
enables memory to be shared with processors that access memory in little endian format (figure 
3.3). Byte data is arranged differently for little endian and the usual big endian. 


Addressm+2 Address m 


Address m + 3} Address m + 1 
23 154 740 


Little endian* 


Note : Only CS2 space of SH7604 can be set. - 





Figure 3.3 Byte, Word, and Longword Alignment in little endian format (SH7604 only) 


3.3 Immediate Data Format 


Byte immediate data is located in an instruction code. Immediate data accessed by the MOV, 
ADD, and CMP/EQ instructions is sign-extended and calculated with registers and longword data. 
Immediate data accessed by the TST, AND, OR, and XOR instructions is zero-extended and 
calculated with longword data. Consequently, AND instructions with immediate data always clear 
the upper 24 bits of the destination register. | | 


Word or longword immediate data is not located in the instruction code. Rather, it is stored ina 
memory table. The memory table is accessed by an immediate data transfer instruction (MOV) 
using the PC relative addressing mode with displacement. Specific examples are given in section 
4.1.8, Immediate Data. 


Section 4 Instruction Features 


4.1 RISC-Type Instruction Set 


All instructions are RISC type. Their features are detailed in this section. 


4.1.1  16-Bit Fixed Length 


All instructions are 16 bits long, increasing program coding efficiency. 


4.1.2 One Instruction/Cycle 


Basic instructions can be executed in one cycle using the pipeline system. Instructions are 
executed in 5O ns at 20 MHz, in 35 ns at 28.7MHz. 


4.1.3 Data Length 


Longword is the standard data length for all operations. Memory can be accessed in bytes, words, 
or longwords. Byte or word data accessed from memory is sign-extended and calculated with 
longword data (table 4.1). Immediate data is sign-extended for arithmetic operations or zero- 
extended for logic operations. It also is calculated with longword data. 


Table 4.1 Sign Extension of Word Data 


SH-1/SH-2 CPU Description Example for Other CPU 
MOV.W @(disp,PC),R1 Data is sign-extended to 32 ADD .W #H'1234,RO0 
ADD R1, RO bits, and R1 becomes 
H'00001234. It is next 
De ee ' operated upon by an ADD 
.DATA.W H'1234 instruction. 


Note: The address of the immediate data is accessed by @(disp, PC). 


4.1.4 Load-Store Architecture 


Basic operations are executed between registers. For operations that involve memory access, data 
is loaded to the registers and executed (load-store architecture). Instructions such as AND that 
manipulate bits, however, are executed directly in memory. 


4.1.55 Delayed Branch Instructions 


Unconditional branch instructions are delayed. Pipeline disruption during branching is reduced by 
first executing the instruction that follows the branch instruction, and then branching (table 4.2). 
With delayed branching, branching occurs after execution of the slot instruction. However, 
instructions such as register changes etc. are executed in the order of delayed branch instruction, 
then delay slot instruction. For example, even if the register in which the branch destination 
address has been loaded is changed by the delay slot instruction, the branch will still be made 
using the value of the register prior to the change as the branch destination address. 


Table 4.2. Delayed Branch Instructions 


SH-1/SH-2 CPU Description Example for Other CPU 
BRA TRGET 7 Executes an ADD before ADD .W R1,R0 
ADD R1,RO branching to TRGET. 


BRA TRGET 


4.1.6 Miultiplication/Accumulation Operation 


SH-1 CPU: 16bit x 16bit — 32-bit multiplication operations are executed in one to three cycles. 
16bit xX 16bit + 42bit > 42-bit multiplication/accumulation operations are executed in two to three 
cycles. 


SH-2 CPU: 16bit x 16bit — 32-bit multiplication operations are executed in one to two cycles. 
16bit x 16bit + 64bit — 64-bit multiplication/accumulation operations are executed in two to three 
cycles. 32bit x 32bit > 64-bit multiplication and 32bit x 32bit + 64bit > 64-bit 
multiplication/accumulation operations are executed in two to four cycles. 


4.1.7 TBit 


The T bit in the status register changes according to the result of the comparison, and in turn is the 
condition (true/false) that determines if the program will branch (table 4.3). The number of 
instructions after T bit in the status register is kept to a minimum to improve the processing speed. 


Table 4.3 T Bit 


_ SH-1/SH-2 CPU Description Example for Other CPU 
CMP/GE _R1,RO T bitis set when RO2R1.The CMP.W — R1,RO 
BT TRGETO program branches to TRGETO BGE TRGETO 
when RO = R1 and to TRGET1 | 
BF TRGET1 when RO<R1. BLT TRGET1 
ADD #—1,RO T bitis not changed by ADD. T ~— SUB..W #1,R0 
CMP/EQ #0, RO bit is set when RO = 0. The BEO TRGET 


program branches if RO = 0. 
BT TRGET 


4.1.8 Immediate Data 


Byte immediate data is located in instruction code. Word or longword immediate data is not input 
via instruction codes but is stored in a memory table. The memory table is accessed by an 
immediate data transfer instruction (MOV) using the PC relative addressing mode with 
displacement (table 4.4). | 


Table 4.4 Immediate Data Accessing 


Classification SH-1/SH-2 CPU Example for Other CPU 
8-bit immediate MOV #H'12,RO0 MOV.B #H'12,R0 
16-bit immediate MOV .W @(disp, PC) ,RO MOV.W #H'1234,R0 


.DATA.W H'1234 
32-bit immediate MOV .L @ (disp, PC) , RO MOV.L #H'12345678,R0 


eeoeo3weeee fe © © @ ee 8H ee Oo 


.DATA.L H'12345678 
Note: The address of the immediate data is accessed by @(disp, PC). 


4.1.9 Absolute Address 


When data is accessed by absolute address, the value already in the absolute address is placed in 
the memory table. Loading the immediate data when the instruction is executed transfers that 
value to the register and the data is accessed in the indirect register addressing mode. 


Table 4.5 Absolute Address 


Classification SH-1/SH-2 CPU Example for Other CPU 
Absolute address MOV.L § @(disp,PC),R1 MOV.B @H'12345678,R0 
MOV .B @R1,RO 


oeoeeee ee ee ee ee oe ee 


-DATA.L H'12345678 





4.1.10 16-Bit/32-Bit Displacement 


When data is accessed by 16-bit or 32-bit displacement, the pre-existing displacement value is 
placed in the memory table. Loading the immediate data when the instruction is executed transfers 
that value to the register and the data is accessed in the indirect indexed register addressing mode. 


Table 4.6 Displacement Accessing 


Classification SH-1/SH-2 CPU | _ Example for Other CPU 
16-bit displacement  Mov.W @(disp, PC) , RO MOV.W  @(H'1234,R1),R2 
MOV.W @(RO,R1) ,R2 


ecoeoeeeeeee ef Be © © ee wB oo @ 


-_DATA.W H'1234 


4.2 Addressing Modes 


Addressing modes and effective address calculation are described in table 4.7. 


Table 4.7. Addressing Modes and Effective Addresses 


Addressing Instruction 








Mode Format Effective Addresses Calculation Formula 
Direct Rn The effective address is register Rn. (The operandis — 
register » the contents of register Rn.) 
addressing | 
Indirect -@Rn The effective address is the content of register Rn. Rn 
register 
Post- @Rn + The effective address is the content of register Rn. A Rn 
increment constant is added to the content of Rn after the (After the 
indirect instruction is executed. 1 is added for a byte instruction is 
register operation, 2 for a word operation, or 4 for a longword executed) 
addressing operation. | 
Byte: Rn + 1 
— Rn 
Word: Rn + 2 
— Rn 
Longword: 
Rn.+4-— Rn 
Pre- @-—Rn The effective address is the value obtained by Byte: Rn-1- 
decrement subtracting a constant from Rn. 1 is subtracted fora — Rn 
indirect byte operation, 2 fora word operation, or 4 fora Word: Rn —2 
register longword operation. Rn 
addressing 
Longword: 
Rn-4-— Rn 
(Instruction 
executed 
with Rn after 
calculation) 
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Table 4.7 


Addressing Instruction 


Format 
@(disp:4, 


Mode 


Indirect 
register 
addressing 
with 
displace- 
ment 


Indirect 
indexed 
register 
addressing 


@(RO, Rn) 


Addressing Modes and Effective Addresses (cont) 


Effective Addresses Calculation 


The effective address is Rn plus a 4-bit displacement 


(disp). The value of disp is zero-extended, and 


remains the same for a byte operation, is doubled for 


a word operation, or is quadrupled for a longword 
operation. 










Rn 
+ disp x 1/2/4 


disp 
(zero-extended 


The effective address is the Rn value plus RO. 


G@)—>_Rn+ RO 





Formula 


Byte: Rn + 
disp 


Word: Rn + 
disp x 2 


Longword: 
Rn + disp x 4 


Rn + RO 


ooo 


Indirect @(disp:8, 
GBR GBR) 
addressing 

with 

displace- 

ment 


Indirect 
indexed 
GBR 
addressing 


@(RO, 
GBR) 


The effective address is the GBR value plus an 8-bit 


displacement (disp). The value of disp is zero- 
extended, and remains the same for a byte 
operation, is doubled for a word operation, or is 
quadrupled for a longword operation. 


disp 
(zero-extended) 









GBR 
+ disp x 1/2/4 











The effective address is the GBR value plus RO. 


GBR + RO 
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Byte: GBR + 
disp 


Word: GBR + 
disp x 2 


Longword: 
GBR + disp x 
4 


GBR + RO 


Table 4.7 Addressing Modes and Effective Addresses (cont) 


Addressing Instruction 
Mode _ Format 
PC relative @(disp:8, 
addressing PC) 
with 

displace- 

ment 


PC relative  disp:8 
addressing 


disp:12 


_ Effective Addresses Calculation ) Formula 


The effective address is the PC value plus an 8-bit | Word: PC + 
displacement (disp). The value of disp is zero- disp x 2 
extended, and disp is doubled for a word operation, Longword: 
or is quadrupled for a longword operation. Fora C& 


longword operation, the lowest two bits of the PC are jweerererc 


masked. + disp x 4 







(for longword) 


PC + disp x 2 
or 
PC&H'FFFFFFFC 
+ disp x 4 















disp 
(zero-extended) 


The effective address is the PC value sign-extended PC + disp x 2 
with an 8-bit displacement (disp), doubled, and 
added to the PC. 


PC + disp x 2 








The effective address is the PC value sign-extended PC +disp x 2 
with a 12-bit displacement (disp), doubled, and 
added to the PC. 


PC 





PC + disp x2] 





disp 
(sign-extended) 
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Table 4.7 Addressing Modes and Effective Addresses (cont) 


Addressing Instruction 
Mode Format Effective Addresses Calculation Formula 


PC relative Rn The effective address is the register PC plus Rn. PC + Rn 


addressing 
(cont) 


[Pos Ro 


Immediate #imm:8 The 8-bit immediate data (imm) for the TST, AND, — 
addressing OR, and XOR instructions are zero-extended. 
#imm:8 The 8-bit immediate data (imm) forthe MOV, ADD, — 


and CMP/EQ instructions are sign-extended. 


#imm:8 Immediate data (imm) for the TRAPA instructionis — 
zero-extended and is quadrupled. 


4.3 Instruction Format 


The instruction format table, table 4.8, refers to the source operand and the destination operand. 
The meaning of the operand depends on the instruction code. The symbols are used as follows: 


xxxx: Instruction code 
mmmm: Source register 
nnnn: Destination register 
1111: Immediate data 

dddd: Displacement 


Table 4.8 Instruction Formats 


Source Destination 
instruction Formats Operand Operand Example 
0 format — — NOP 
15 0 
n format — nnnn: Direct MOVT Rn 
| register 
15 0 Control register = nnnn: Direct STS MACH,Rn 
register 
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-Table 4.8 Instruction Formats (cont) 


Instruction Formats 


n format (cont) 


m format 


Source 
‘Operand — 


Control register 
or system 
register 


mmmm: Direct 


Destination 
Operand 

nnnn: Indirect pre- 
decrement 
register 


Example 
STC.L SR,@-Rn 


Control register or 


1 


5 


HT 


XXXX  XXXX 


register 


mmmm: Indirect 
post-increment 
register 


mmmm: Direct 
register 


mmmm: PC 


system register 


Control register or 
system register 


LDC Rm,SR 


LDC.L @Rm+,SR 


relative using Rm 


nm format 


All 


XXXX 


md format 
15 

nd4 format | 
15. 

Note: 


0 
nnnn |mmmmy} xxxx 


dddd 


Al 


Q 


In multiply/accumulate instructions, nnnn is the source register. 


mmmm: Direct 
register 


mmmm: Direct 
register 


mmmm: Indirect 
post-increment 
register (multiply/ 
accumulate) 
nnnn*: Indirect 
post-increment 
register (multiply/ 
accumulate) 
mmmm: Indirect 


post-increment 
register 


mmmm: Direct 
_register 


mmmm: Direct 
register 


mmmmdddd: 
indirect register 
with 
displacement 
RO (Direct 
register) 
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nnnn: Direct 
register 


nnnn: Indirect 
register 


MACH, MACL 


nnnn: Direct 
register 


nnnn: Indirect pre- 
decrement 
register 

nnnn: Indirect 
indexed register 


RO (Direct 
register) 


nnnndddd: 
Indirect register 
with displacement 


JMP @Rm 
BRAF Rm 
ADD Rm, Rn 


MOV.L Rm, @Rn 


MAC .W 
@Rm+, @Rn+ | 


MOV.L @Rm+t,Rn 
MOV.L Rm, @-Rn 


MOV.L 
Rm, @(RO,Rn) 


MOV .B 
@( disp, Rm) ,RO 


MOV .B 
RO, @(disp,Rn) 


Table 4.8 Instruction Formats (cont) 


Source Destination 
Instruction Formats Operand Operand Example 
nmd format mmmm: Direct = nnnndddd: Indirect MOV.L 
15 0 register register with Rm, @(disp, Rn) 
mmmmdddd: nnnn: Direct MOV.L 
Indirect register register @(disp, Rm),Rn 
with 
displacement 
d format dddddddd: RO (Direct register) MOV.L 
15 Q ‘Indirect GBR @(disp,GBR) , RO 
displacement 
RO(Direct dddddddd: Indirect MOV.L 
register) GBR with RO, @(disp, GBR) 


displacement 
dddddddd: PC __— RO (Direct register) MOvA 


relative with @(disp,PC),RO 
displacement 
dddddddd: PC — BF label 
relative 
d12 format dddddddddddd: — BRA label 
15 Q  PCrelative (label = disp + 
dddd dddd ddd PC) 
nd8 format dddddddd: PC _nnnn: Direct MOV.L 
15 0 relative with register @(disp,PC),Rn 
dddd  dddd displacement 
| format liililii: Immediate Indirect indexed AND .B 
GBR #imm, @(RO,GBR) 
15 QO _iiiiiiii: Immediate RO (Direct register) AND #imm, RO 
[eve [inti 
. lillilii: Immediate — TRAPA #imm 
ni format liiliiliz Immediate nnnn: Direct ADD #imm, Rn 
15 0 register 


XXXX | NANN 
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Section 5 Instruction Set 


5.1 Instruction Set by Classification 


Table 5.1 lists instructions by classification. 


Table 5.1 Classification of Instructions 


Applicable 
Instructions 
Operation No. of 
Classification Types Code Function SH-2 SH-1_ Instructions 
Datatransfer 5 MOV Data transfer Y v 39 
Immediate data transfer 
Peripheral module data transfer 
Structure data transfer 
MOVA Effective address transfer vY Y 
MOVT T-bit transfer vY v 
SWAP Swap of upper and lower bytes vo v 
XTRCT Extraction of the middle of Y Vv 
registers connected 
Arithmetic 21 ADD Binary addition vy v 33 
operations ADDC Binary addition with carry Yev 
ADDV Binary addition with overflow v v 
check 
CMP/cond Comparison Y Y 
DIV1 Division Y v 
DIVOS Initialization of signed division v vY 
DIVOU Initialization of unsigned v vY 
division 
DMULS Signed double-length VY 
multiplication 
DMULU Unsigned double-length v 
multiplication 
DT Decrement and test Y 
EXTS Sign extension VY Y 
EXTU Zero extension v vY 
MAC Multiply/accumulate, double- vY v 
length multiply/accumulate 
operation’ 
MUL Double-length multiplication Y 
MULS Signed multiplication v v 
MULU Unsigned multiplication Y Y 
NEG Negation v V 
NEGC Negation with borrow vY Y 
SUB Binary subtraction vY Y 
SUBC Binary subtraction with borrow wo Y 
SUBV Binary subtraction with vY Y 


underflow check 


Notes 1. Double-length multiply/accumulate is an SH-2 function. 
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Table 5.1 Classification of Instructions (cont) 


Applicable 


— Instructions | 
Operation | No. of 
Classification Types Code Function SH-2 SH-1 Instructions 
Logic 6 AND Logical AND Vv v 14 
operations NOT ___ Bit inversion Yev 
OR Logical OR vv 
TAS Memory test and bit set V v 
TST ~~ ~—_~_Logical AND and T-bit set v vY 
XOR Exclusive OR v v 
Shift 10 ROTL One-bit left rotation _ v v 14 
~ ROTR —_One-bit right rotation vev 
ROTCL One-bit left rotation with T bit v vy 
ROTCR  One-bit right rotation with T bit wv v 
SHAL One-bit arithmetic left shift Yv- 
SHAR — One-bit arithmeticright shift vVvov 
SHLL  —. One-bit logical left shift Yev 
SHLLn n-bit logical left shift Y v 
SHLR One-bit logical right shift v V 
SHLRn _n-bit logical right shift Yev 
Branch 9 BF Conditional branch, conditional Yo v 11 
branch with delay*2 (T = 0) | 
BT Conditional branch, conditional wv Y 
branch with delay*2 (T = 1) 
BRA Unconditional branch v v 
BRAF Unconditional branch v 
BSR Branch to subroutine procedure ¥ Vv 
BSRF Branch to subroutine procedure 
JMP ~ Unconditional branch v v 
JSR_ Branch to subroutine procedure _¥ v 
RTS Return from subroutine v v 
procedure 


Notes 2. Conditional branch with delay is an SH-2 CPU function. 
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Table 5.1 Classification of Instructions (cont) 


Applicable 
Instructions 
Operation No. of 
Classification Types Code Function SH-2 SH-1_ Instructions 
System 11 CLRT T-bit clear vY v 31 
control CLRMAC MAC register clear Yev 
LDC Load to control register vY v 
LDS Load to system register vY v 
NOP No operation VY v 
RTE Return from exception v vY 
processing _ 
SETT T-bit set v v 
SLEEP Shift into power-down mode W v 
STC Storing control registerdata wv v 
STS Storing system registerdata VW vo 
TRAPA Trap exception processing v Y 
Total: 62 142 
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Instruction codes, operation, and execution states are listed in table 5.2 in order by classification. 


Table 5.2. Instruction Code Format 


Item Format 


Instruction OP.Sz SRC,DEST 
mnemonic _ 


Instruction MSB © LSB 
code 


Operation ae 
summary (xx) 
M/Q/T 
& 
| 
A 
<<n, >>n 
Execution 
cycle 
Instruction 
execution 
cycles 
T bit 


Explanation 

OP: Operation code 
$z: Size 

SRC: Source 


_ DEST: Destination 


Rm: Source register 
Rn: Destination register 
imm: Immediate data 
disp: Displacement* 


mmmm: Source register 
nnnn: Destination register 
0000: RO 
0001: R1 
1111: R15 
iii: Immediate data 
dddd: Displacement 


Direction of transfer 
Memory operand | 

Flag bits in the SR 
Logical AND of each bit 
Logical OR of each bit 
Exclusive OR of each bit 
Logical NOT of each bit 
n-bit left/right shift 


Value when no wait states are inserted 


The execution cycles shown in the table are minimums. 
The actual number of cycles may be increased: 


1. When contention occurs between instruction fetches 
and data access, or 


2. When the destination register of the load instruction 
(memory — register) and the register used by the next 
instruction are the same. 


Value of T bit after instruction is executed 


No change 


Note: Scaling (x1, x2, x4) is performed according to the instruction operand size. See "6. 
Instruction Descriptions" for details. 
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5.1.1 


Tables 5.3 to 5.8 list the minimum number of clock states required for execution. 


Table 5.3 


Instruction 
MOV #imm, Rn 


MOV.W @(disp,PC),Rn 


MOV.L @(disp, PC) ,Rn 
MOV Rm, Rn 

Rm, @Rn 

Rm, @Rn 

Rm, @Rn 


@Rm, Rn 


MOV .B 
MOV .W 
MOV .L 
MOV .B 


MOV.W @Rm,Rn 


MOV .L 
MOV .B 
MOV .W 
MOV.L 
MOV .B 


@Rm, Rn 

Rm, @—Rn 
Rm, @—Rn 
Rm, @—Rn 
@Rm+ , Rn 


MOV.W @Rm+,Rn 


MOV.L 
MOV .B 
MOV .W 
MOV .L 
MOV .B 


@Rm+, Rn 

RO, @ (disp, Rn) 
RO, @(disp, Rn) 
Rm, @(disp, Rn) 
@(disp, Rm) ,RO 


MOV.W @(disp,Rm) ,RO 


MOV .L 
MOV.B 
MOV .W 


@(disp,Rm) ,Rn 
Rm, @(RO, Rn) 
Rm, @(RO, Rn) 


Data Transfer Instructions 


Data Transfer Instructions 


Instruction Code 


1001nnnndddddddd 


1101nnnndddddddd 
0110nnnnmmmm0 011 
0010nnnnmmmmd 000 
0010nnnnmmmm0001 
0010nnnnmmmm0010 


0110nnnnmmmm0000 
0110nnnnmmmm0001 


0110nnnnmmmm0010 
0010nnnnmmnmm0100 
0010nnnnmmmm0101 
0010nnnnmmmm0110 


0110nnnnmmmm0100 
0110nnnnmmmm0 101 


0110nnnnmmmm0110 
10000000nnnndddd 
10000001nnnndddd 
0001nnnnmmmmdddd 
10000100mmmmdddd 


10000101mmmmdddd 


0101nnnnmmmmdddd 
0000nnnnmmmm0 100 
0000nnnnmmmmn0101 
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Operation 


imm — Sign extension > 
Rn 


(disp x 2 + PC) > Sign 
extension > Rn 


(disp x 4+ PC) > Rn 
Rm — Rn 

Rm — (Rn) 

Rm — (Rn) 

Rm — (Rn) 


(Rm) — Sign extension > 
Rn 


(Rm) — Sign extension > 
Rn 


(Rm) > Rn 

Rn—1 —> Rn, Rm > (Rn) 
Rn-—2 — Rn, Rm — (Rn) 
Rn—4 — Rn, Rm — (Rn) 


(Rm) — Sign extension > 
Rn,Rm + 1—- Rm 


(Rm) — Sign extension > 
Rn,Rm +2 — Rm 


(Rm) > Rn,Rm + 4-— Rm 
RO — (disp + Rn) 

RO — (disp x 2 + Rn) 

Rm — (disp x 4 + Rn) 


(disp + Rm) — Sign 
extension — RO 


(disp x 2 + Rm) — Sign 
extension — RO 


(disp x 4+ Rm) > Rn 
Rm — (RO + Rn) 
Rm — (RO + Rn) 


Execu- 
tion T 
State Bit 


1 = 


ne ee ee ee | eS ee ee ee ee | 


ee i ee eS 





Table 5.3 

Instruction 

MOV.L Rm,@(RO,Rn) 
MOV .B @(RO,Rm) ,Rn 
MOV.W  @(RO,Rm),Rn 
MOV .L @(RO,Rm),Rn 
MOV.B  R0O,@(disp,GBR) 
MOV.W  RO,@(disp,GBR) 
MOV.L RO,@(disp, GBR) 
MOV.B @(disp,GBR) ,RO 
MOV .W @ (disp, GBR) ,RO 
MOV.L @(disp,GBR) , RO 
MOVA @(disp, PC) , RO 
MOVT Rn 

SWAP.B Rm,Rn 

SWAP.W Rm,Rn > 

XTRCT Rm, Rn 


Data Transfer Instructions (cont) 


Instruction Code 
0000nnnnmmmm0110 
0000nnnnmmmm1100 


0000nnnnmmmm1101 


0000nnnnmmmm1110 
11000000dddddddd 
11000001ddddddda 
11000010ddddddda 
11000100dddddddd 


11000101dddddddd 


11000110dddddddd 
11000111dddddddd 
0000nnnn00101001 
0110nnnnmmmm1000 


011 0nnnnmmnmm1001 


0010nnnnmmmm1101 
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Operation 
Rm — (RO + Rn) 


(RO + Rm) — Sign 
extension — Rn 


(RO + Rm) > Sign 
extension —> Rn 


(RO + Rm) > Rn 

RO — (disp + GBR) 
RO — (disp x 2 + GBR) 
RO — (disp x 4+ GBR) 


(disp + GBR) — Sign 
extension — RO 


(disp x 2 + GBR) > Sign 
extension — RO 


(disp x 4 + GBR) — RO 
disp x 4+ PC > RO 
T— Rn 


Rm — Swap upper and 
lower 2 bytes— Rn 


Rm — Swap upper and 
lower word > Rn 


Center 32 bits of Rm and 
Rn—- Rn | 


Execu- 
tion 
State 


ee ee eS 


oma, amd, as + oh, 


T 
Bit 


5.1.2 Arithmetic Instructions 


Table 5.4 
Instruction 

ADD Rm, Rn 
ADD #imm, Rn 
ADDC Rm, Rn 
ADDV Rm, Rn 
CMP/EQ #imm, RO 
CMP/EQ  Rm,Rn 
CMP/HS  Rm,Rn 
CMP/GE  Rm,Rn 
CMP/HI Rm, Rn 
CMP/GT  Rm,Rn 


CMP/PL Rn 


CMP/ PZ Rn 
CMP/STR Rm,Rn 
DIV1 Rm, Rn 
DIVOS Rm, Rn 
DIVOU 


Arithmetic Instructions 


Instruction Code 
0011nnnnmmmm1100 


0011nnnnmmmm1110 


0011nnnnmmmm1111 


0011nnnnmmmm0 000 


0011nnnnmmmm0 010 


0011nnnnmmmm0011 


0011innnnmmmm0110 


0011nnnnmmmm0111 


0100nnnn00010101 


0100nnnn00010001 


0010nnnnmmmm1100 


0011nnnnmmmm0100 


0010nnnnmmmm0111 


0000000000011001 


Operation 
Rn + Rm —> Rn 
Rn + imm —> Rn 


Rn +Rm+T-— Rn, 
Carry > T 


Rn + Rm — Rn, 
Overflow > T 


If RO=imm, 1 > T 


If Rn = Rm, 1 > T 


If Rn=Rm with 
unsigned data, 1 > T 


If Rn > Rm with 
signed data, 1 > T 


If Rn > Rm with 
unsigned data, 1 > T 


lf Rn > Rm with 
signed data, 1 > T 


IfRn>0,15T 


If Rn 20,1 5T 


If Rn and Rm have an 
equivalent byte, 1 > 
T 

Single-step division 
(Rn/Rm) 

MSB of Rn > Q, 


MSB of Rm > M, M4 
Q>T 


0 > M/Q/T 
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Execution 
State 


1 
1 
1 


T Bit 


Carry 
Overflow 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Compariso 
n result 


Calculation 
result 


Calculation 
result 


Table 5.4 Arithmetic Instructions (cont) 


Instruction 
DMULS.L Rm,Rn*? 


DMULU.L Rm, Rn*2 


DT Rn*2 
EXTS.B  Rm,Rn 
EXTS.W  Rm,Rn 
EXTU.B  Rm,Rn 
EXTU.W  Rm,Rn 
MAC.L @Rm+, @Rn+ 

a2 
MAC .W @Rm+, @Rn+ 
MUL.L Rm, Rn*2 
MULS.W  Rm,Rn 
Notes: 1. 


Instruction Code 
0011nnnnmmmm1i101 


0011nnnnmmmm0101 


0100nnnn00010000 


011 0nnnnmmmm1110 


011 0nnnnmmmm1111 


0110nnnnmmmm1100 


0110nnnnmmmm1101 


0000nnnnmmmm1111 


0100nnnnmmmm1111 


0000nnnnmmmm0111 


0010nnnnmmmm1111 | 


Operation 


Signed operation of 
Rn x Rm — MACH, 
MACL | 


32 x 32 — 64 bits 


Unsigned operation 
of Rn x Rm — MACH, 
MACL 


32 x 32 > 64 bits 


Rn - 1 — Rn, when 
Rn is 0, 1 — T. When 
Rn is nonzero, 0 > T 


A byte in Rm is sign- 
extended — Rn 


A word in Rm is sign- 
extended — Rn 


A byte in Rm is zero- 
extended — Rn 


A word in Rm is zero- 
extended — Rn 


Signed operation of 
(Rn) x (Rm) + MAC 
— MAC 


32 x 32 + 64— 64 bits 


Signed operation of 
(Rn) x (Rm) + MAC 
— MAC 

(SH-2 CPU) 16 x 16 + 
64 —> 64 bits 


(SH-1 CPU) 16 x 16 + 
42 — 42 bits 


Rn x Rm > MACL, 
32 x 32 > 32 bits 


Signed operation of 
RnxRm— MAC - 


16 x 16 — 32 bits 


Execution 
State T Bit 


2 to 4*! — 


2 to 4*! — 


1 Compariso 
n result 


3/(2 to 4)*! — 


3/(2)*" — 


2 to 4*! — 


1 to 3”! — 


The normal minimum number of execution states (The number in parentheses is the 


number of states when there is contention with preceding/following instructions) 
2. SH-2 CPU instructions 
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Table 5.4 Arithmetic Instructions (cont) 
Execution 
Instruction Instruction Code Operation State T Bit 
MULU.W  Rm,Rn 0010nnnnmmmm1110 Unsigned operation _—1 to 3*! — 
of Rn x Rm — MAC 
16 x 16 — 32 bits 
NEG Rm, Rn 0110nmnnnmmmm1011 jO-Rm—-Rn 1 — 
NEGC Rm, Rn 0110nnnnmmmm1010 jj. O0—Rm-T — Rn, 1 Borrow 
Borrow > T 
SUB Rm, Rn 0011nnnnmmmm1000  Rn—Rm—> Rn 1 — 
SUBC Rm, Rn 0011nnnnmmmm1010 j§ Rn—Rm-T — Rn, 1 Borrow 
Borrow > T 
SUBV Rm, Rn 0011nnnnmmmm1011 Rn—Rm— Rn, 1 Underflow 
Underflow > T 
Notes: 1. The normal minimum number of execution states (The number in parentheses is the 
number of states when there is contention with preceding/following instructions) 
5.1.3. Logic Operation Instructions 
Table 5.5 Logic Operation Instructions 
Execution 
Instruction Instruction Code Operation State T Bit 
AND Rm, Rn 0010nnnnmmmm1001 j$Rn&Rm-—-Rn 1 — 
AND #imm, RO 110010011111111i1 RO & imm — RO 1 a 
AND.B #imm,@(RO,GBR) 11001101iiiiiiii (RO+GBR)&imm—-_ 3 — 
(RO + GBR) | 
NOT Rm, Rn 0110nnnnmmmm0111 $~Rm-— Rn 1 — 
OR Rm, Rn 0010nnnnmmmm1011 Rn! Rm— Rn 1 — 
OR #imm, RO 11001011i11131111 RO | imm —> RO 1 — 
OR.B  #imm,@(RO,GBR) 11001111iiiiiiii (RO+GBR)limm> 3 — 
(RO + GBR) 
TAS.B  @Rn 0100nnnn00011011 If(Rn)isO,1-~>T;1> 4 Test 
MSB of (Rn) result 
TST Rm, Rn 0010nnnnmmmm1000 Rn&Rm; if the resultis 1 Test 
0,1->T result 
TST #imm, RO 11001000iiiiiiii RO & imm; if the result 1 Test 
isO, 1 ~T result 
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Table 5.5 


SHLR16 


Logic Operation Instructions (cont) 
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Execution 
Instruction instruction Code Operation State —T Bit 
TST.B #imm,@(RO,GBR)  11001100iiiiiiii | (RO+ GBR) &imn: if 3 Test 
the result is 0, 1 > T result 
XOR Rm, Rn 0010nnnnmmmm1010 Rn*Rm—-Rn 1 — 
XOR #imm, RO 1100101011111111 RO“ imm — RO 1 — 
XOR.B #imm,@(RO,GBR) 11001110iiiiiiii (RO+GBR)“’imm—> _— 3 — 
(RO + GBR) 
5.1.4 Shift Instructions 
Table 5.6 Shift Instructions 
Instruction _ Instruction Code Operation Execution State T Bit 
ROTL Rn 0100nnnn00000100 T<¢-Rn¢«MSB 1 MSB 
ROTR Rn 0100nnnn00000101 LSB—+Rn-T 1 LSB 
ROTCL Rn 0100nnnn00100100 T¢eRneT 1 MSB 
ROTCR Rn 0100nnnn00100101 T—-3Rn->T 1 LSB 
SHAL Rn 0100nnnn00100000 + T<-Rn¢-O 1 MSB 
SHAR Rn 0100nnnn00100001 MSB —> Rn > T 1 LSB 
SHLL Rn 0100nnnn00000000 T«Rn¢O 1 MSB 
SHLR Rn 0100nnnn00000001 O-Rn-T 1 LSB 
SHLL2 Rn 0100nnnn00001000 Rn<<2 — Rn 1 — 
SHLR2 Rn 0100nnnn00001001. #Rn>>2—> Rn 1 — 
SHLL8 Rn 0100nnnn00011000 Rn<<8— Rn 1 — 
SHLR8 Rn 0100nnnn00011001 Rn>>8—> Rn 1 — 
_ SHLL16 Rn 0100nnnn00101000 Rn<<16 — Rn 1 — 
Rn 0100nnnn00101001 #Rn>>16 > Rn 1 — 





5.1.5 
Table 5.7 


Instruction 
BF label 


BF/S label *? 


BT label 


BT/S label*? 


BRA label 


BRAF Rm*? 


BSR label 


BSRF Rm*? 


JMP @Rm 


JSR @Rm 


RTS 


Branch Instructions 


Branch Instructions 


Instruction Code 

10001011ddddddda 
10001111dddddddd 
10001001dddddddd 
10001101dddddddd 
1010dddddddddddda 


0000mmmm00100011 
1011dddddddddddd 


0000mmmm00000011 


0100mmmm00101011 
0100mmmm00001011 


0000000000001011 


Notes: 2. SH-2 CPU instruction 
3. One state when it does not branch 


Operation 


lf T =0, disp x2 + PC — PC; if T= 
1, nop (where label is disp x 2 + 
PC) 


Delayed branch, if T = 0, disp x 2 + 
PC > PC; if T=1, nop 


If T =1, disp x2 + PC — PC; if T= 
0, nop (where label is disp + PC) 


Delayed branch, if T = 1, disp x 2 + 
PC — PC; if T =0, nop 


Delayed branch, disp x2 + PC > 
PC 


Delayed branch, Rm + PC > PC 


Delayed branch, PC — PR, disp x 2 
+ PC — PC 


Delayed branch, PC — PR, Rm + 
PC — PC 


Delayed branch, Rm — PC 


Delayed branch, PC > PR, Rm > 
PC 


Delayed branch, PR — PC 


af 


Execution 
State T Bit 


3/1*3 — 


2i1*8 — 


3/1*3 — 


2/1*8 —_ 


5.1.6 


System Control Instructions 


Table 5.8 System Control Instructions 


Instruction 

CLRT | 
CLRMAC 

LDC Rm, SR 
LDC Rm, GBR 
LDC Rm, VBR 
@Rm+,SR 
@Rm+,GBR 
@Rm+,VBR 
LDS Rm, MACH 
LDS Rm, MACL 
LDS Rm, PR 
@Rm+, MACH 


LDS.L @Rm+,MACL 


LDS.L @Rm+,PR 
“NOP 


RTE 


SETT 

SLEEP 

STC SR, Rn 
STC GBR, Rn 
STC VBR, Rn 
STC.L SR,@-Rn 
STC.L GBR, @-—Rn 
STC.L VBR, @-Rn 
STS MACH, Rn 
STS MACL, Rn 


STS PR, Rn 


Instruction Code 


0000000000001000 | 
0000000000101000 © 
0100mmmm00001110 


0100mmmm00011110 
0100mmmm00101110 
0100mmmm00000111 
0100mmmm00010111 
0100mmmm00100111 
0100mmmm00001010 


0100mmmm00011010 


0100mmmm00101010 
0100mmmm00000110 


0100mmmm00010110 


0100mmmm00100110 
0000000000001001 
0000000000101011 


0000000000011000 
0000000000011011 
0000nnnn00000010 
0000nnnn00010010 
0000nnnn00100010 
0100nnnn00000011 
0100nnnn00010011 
0100nnnn00100011 
0000nnnn00001010 
0000nnnn00011010 
0000nnnn00101010 


Operation 

O->T . 

0 — MACH, MACL 

Rm —> SR 

Rm — GBR 

Rm —> VBR 

(Rm) > SR, Rm+4-— Rm 
(Rm) ~ GBR, Rm+4-— Rm 
(Rm) > VBR, Rm+4— Rm 
Rm — MACH | 

Rm > MACL 


Rm—- PR 


(Rm) ~ MACH, Rm+4—> 
Rm 


(Rm) > MACL, Rm+4- 
Rm , 


(Rm) — PR, Rm+4— Rm 
No operation 


Delayed branch, stack area > 
PC/SR 


ToT 

Sleep 

SR — Rn 

GBR — Rn 

VBR —> Rn 

Rn—4 > Rn, SR = (Rn) 
Rn—4 — Rn, GBR —> (Rn) 
Rn—4 — Rn, VBR — (Rn) 
MACH — Rn 

MACL — Rn 

PR — Rn 
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Execution T 


State 
, 


(e) ah 
* 
as 


— | | st | PQ] NT NO] |] ss] 


Bit 





Table 5.8 


Instruction Instruction Code 

STS.L MACH, @-Rn 0100nnnn00000010 
STS.L MACL,@—-Rn 0100nnnn00010010 
STS.L PR,@-Rn 0100nnnn00100010 
TRAPA #imm 1100001111111111 


5.2 


System Control Instructions (cont) 


Operation 
Rn—4 — Rn, MACH —> (Rn) 1 
Rn—4 — Rn, MACL - (Rn) 
Rn—4 > Rn, PR - (Rn) 


PC/SR — stack area, (imm x 


4+ VBR) > PC 
Notes: 4. The number of execution states before the chip enters the sleep state 


Executio T 
n State Bit 


1 
1 
8 


The above table lists the minimum execution cycles. In practice, the number of execution 
cycles increases when the instruction fetch is in contention with data access or when the 
destination register of a load instruction (memory — register) is the same as the register 

used by the next instruction. 


Instruction Set in Alphabetical Order 


Table 5.9 alphabetically lists instruction codes and number of execution cycles for each 
instruction. 


Table 5.9 


instruction 

ADD #imm, Rn 

ADD Rm, Rn 

ADDC Rm,Rn 

ADDV kRm,Rn 

AND #imm, RO 

AND Rm, Rn 

AND.B #imm,@(RO,GBR) 
BF label 

BF/S  label*? 


Instruction Set 


Instruction Code 


001L1innnnmmmm1100 
0011innnnmmmm1110 


0011nnnnmmmm1111 


10001011dddddddd 


10001111dddddddd 
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Operation 
Rn +imm — Rn 
Rn + Rm — Rn 


Rn + Rm+T— Rn, 
Carry > T 


Rn + Rm => Ro, 
Overflow > T 


RO & imm —> RO 
Rn & Rm —- Rn 
(RO + GBR) & imm 
— (RO + GBR) 


lf T =0, disp x2 + 
PC > PC; if T= 1, 
nop 

lfT =0, disp x 2+ 


PC + PC; if T=1, 
nop 


Execu- 
tion 
State 


1 


ioe) 


3/1*3 


2/1*3 


T Bit 


Carry 


Overflow 


Table 5.9 


Instruction Set (cont) 


Notes: 2. SH-2 CPU instructions 





Execu- 
| 7 tion 
Instruction _ Instruction Code Operation State T Bit 
BRA label 1010ddddddddddad Delayed branch, 20 — 
disp x2 + PC > 
PC 
BRAF Rm*2 0000mmmm00100011 Delayed branch, 2 — 
A Rm + PC > PC 
BSR label 1011dddddddddddd Delayed branch, 2 — 
PC — PR, disp x 2 
+PC-— PC 
BSRF Rm*2 0000mmmm00000011 Delayed branch, 2 — 
PC — PR, Rm + 
PC — PC 
BT label 10001001dddddddd lf{T=1,disp x2+  3/1*8 — 
PC > PC; if T =0, 
nop 
BT/S label *? 10001101dddddddd lf{T=1,dispx2+ 21% — 
PC — PC; if T=0, 
nop 
CLRMAC 0000000000101000 0- MACH, MACL 1 cis 
CLRT 0000000000001000 0O->T 1 0 
CMP/EQ #imm,RO 10001000iiiiiiii IfRO=imm,1—3T 1 Comparison 
: | result 
CMP/EQ Rm,Rn 0011nnnnmmmm0000 IfRn=Rm,1>T 1 Comparison 
result 
CMP/GE Rm,Rn 0011nnnnmmmm0011 If Rn = Rm with 1 Comparison 
| signed data, 1 > T result 
CMP/GT Rm,Rn 0011nnnnmmmm0111 If Rn > Rm with 1 Comparison 
signed data, 1 > T result 
CMP/HI Rm,Rn 0011nnnnmmmm0110 If Rn > Rm with 1 Comparison 
unsigned data, result 
1oT 
CMP/HS Rm,Rn 0011nnnnmmmm0010 lf Rn => Rm with 1 Comparison 
| unsigned data, result 
| ToT 
CMP/PL Rn 0100nnnn00010101 If Rn>0, 1 > T 1 Comparison 
result 
CMP/PZ Rn 0100nnnn00010001 lf Rn 20,1 3T 1 Comparison 
result 


3. One state when it does not branch 


Table 5.9 


Instruction 


CMP/STR km,Rn 


Rm, Rn 


DIVOS 


DIVOU 


DIV1 Rm, Rn 


DMULS.L Rm,Rn*? 


Rm, Rn*? 


DMULU . L 


DT Rn*? 


EXTS .B Rm, Rn 


EXTS .W Rm, Rn 
EXTU.B Rm, Rn 
EXTU.W 


Rm, Rn 


JMP @Rm 


Instruction Set (cont) 


Instruction Code 
0010nnnnmmmm1100 


0010nnnnmmmm0111 


0000000000011001 
0011nnnnmmmm0100 


001innnnmmmm1101 


0011nnnnmmmm0101 


0100nnnn00010000 


0110nnnnmmmm1110 


0110nnnnmmmm1111 


0110nnnnmmmm1100 


0110nnnnmmmm1101 


0100mmmm00101011 


Operation 


If Rn and Rm have 
an equivalent byte, 
1oT 


MSB of Rn > Q, 
MSB of Rm > M, 
M4Q>T 


0 > M/Q/T 


Single-step division 
(Rn/Rm) 


Signed operation of 
Rn x Rm — MACH, 
MACL 


Unsigned operation 
of Rn x Rm > 
MACH, MACL 


Rn - 1 — Rn, when 
Rnis0,1—>T. 
When Rn is 
nonzero, 0 > T 


A byte in Rm is 
sign-extended —> 
Rn | 


A word in Rm is 
sign-extended — 
Rn 


A byte in Rm is 
zero-extended > 
Rn 


A word in Rm is 
zero-extended > 
Rn 


Delayed branch, 
Rm — PC 


Notes: 1. The normal minimum number of execution states 
2. SH-2 CPU instructions 


Execu- 


tion 
State 


1 


2 to 4*! 


2 to 4*! 


T Bit 


Comparison 
result 





Calculation 
result 


0 


Calculation 
result 


Comparison 
result 





Table 5.9 Instruction Set (cont) 
Execu- 
tion | ) 
Instruction Instruction Code Operation State T Bit 
JSR @Rm 0100mmmm00001011 Delayed branch, 2 — 
PC > PR, Rm > 
PC a 
LDC Rm,GBR 0100mmmm00011110 Rm > GBR 1 — 
LDC Rm, SR 0100mmmm00001110 Rm > SR 1 LSB 
LDC Rm, VBR 0100mmmm00101110 Rm — VBR 1 — 
LDC.L @Rm+,GBR 0100mmmm00010111 (Rm) GBR, Rm 3 — 
+4— Rm 
LDC.L @Rm+,SR 0100mmmm00000111 (Rm) > SR, Rm+ 3 LSB 
4— Rm 
LDC.L  @Rm+,VBR 0100mmmm00100111 (Rm) > VBR, Rm 3 — 
+4-—>Rm 
LDS Rm, MACH 0100mmmm00001010 Rm — MACH 1 — 
LDS Rm, MACL 0100mmmm00011010 Rm — MACL 1 — 
LDS Rm, PR 0100mmmm00101010 Rm — PR 1 — 
LDS.L  @Rmt+,MACH 0100mmmm00000110 (Rm) — MACH, 1 —_ 
Rm+4— Rm 
LDS.L @Rm+,MACL 0100mmmm00010110 (Rm) > MACL, 1 — 
| Rm+4— Rm 
LDS.L @Rm+,PR 0100mmmm00100110 (Rm) > PR, Rm+ 1 — 
4— Rm 
MAC.L @Rm+, @Rn+*? 0000nnnnmmmm1111 Signed operation of 3/(2to — 
(Rn) x(Rm) +MAC  4)*! 
— MAC 
MAC.W @Rm+, @Rn+ 0100nninnmmmm1111 Signed operation of 3/(2)*! — 
(Rn) x (Rm) + MAC | 
— MAC 
MOV #imm, Rn 1110nnnniiiiiiii imm — Sign 1 — 
extension — Rn 
MOV Rm, Rn 0110nnnnmmmm0011 Rm — Rn 1 — 
Notes: 1. The normal minimum number of execution states (the number in parentheses is the 


number of states when there is contention with preceding/following instructions) 
2. SH-2 instructions 
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Table 5.9 


Instruction 


MOV. 


MOV. 


MOV. 


MOV. 


MOV. 


MOV. 
MOV. 
MOV. 
MOV. 


MOV. 
MOV. 


MOV. 


MOV. 


B 


wy) wl] wo] w 


MOV. L 


MOV. 


MOV.L 
MOV.L 


MOV. 


MOV. L 
MOV.L 


MOV. 


MOV .W 


@(disp,GBR) , RO 


@(disp, Rm) , RO 
@(RO,Rm),Rn 


@Rm+, Rn 


@Rm, Rn 


RO, @(disp, GBR) 
RO, @(disp, Rn) 
Rm, @(RO, Rn) 
Rm, @—Rn 


Rm, @Rn 
@(disp,GBR) , RO 


@(disp,PC),Rn 
@(disp, Rm) ,Rn 


@(RO,Rm),Rn 
@Rm+, Rn 


@Rm, Rn 
RO, @(disp, GBR) 


Rm, @(disp, Rn) 


Rm, @(RO,Rn) 
Rm, @—Rn 


Rm, @Rn 
@(disp, GBR), RO 


Instruction Set (cont) 


Instruction Code 


11000100dddddddd 


10000100mmmmdddd 


0000nnnnmnmn1100 


0110nnnnmmmm0100 


0110nnnnmmmm0000 


11000000dddddddd 
10000000nnnndddd 
0000nnnnmmmm0100 
0010nnnnmmmm0100 


0010nnnnmmmm0000 
11000110dddddddd 


1101nnnndddddddd 


0101nnnnmmmmdddd 


0000nnnnmmmm1110 
0110nnnnmmmm0110 


0110nnnnmmmm0010 
11000010dddddddd 


0001innnnmmmmdddd 


0000nnnnmmmm0110 
0010nnnnmmmm0110 


0010nnnnmmmm0010 
11000101dddddddd 
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Operation 


(disp + GBR) > 
Sign extension > 
RO 


(disp + Rm) — Sign 
extension — RO 


(RO + Rm) — Sign 
extension — Rn 


(Rm) > Sign 
extension > Rn, 
Rm+1— Rm 


(Rm) > Sign 
extension — Rn 


RO — (disp + GBR) 
RO — (disp + Rn) 
Rm = (RO + Rn) 


Rn—-1 — Rn, Rm > 
(Rn) 


Rm — (Rn) 
(disp x 4 + GBR) > 
RO 


(disp x 4 + PC) > 
Rn 


(disp x 4 + Rm) > 
Rn 


(RO + Rm) — Rn 


(Rm) > Rn, Rm + 4 
— Rm 


(Rm) > Rn 


RO > (disp x4 + 
GBR) 


Rm -> (disp x 4 + 
Rn) 


Rm — (RO + Rn) 


Rn—4 — Rn, Rm > 
(Rn) 

Rm — (Rn) 

(disp x 2 + GBR) > 


Sign extension > 
RO 


Execu- 
tion 
State 

1 


1 


T Bit 


Table 5.9 


Instruction Set (cont) 


Notes: 1. The normal minimum number of execution states 
2. SH-2 CPU instructions 
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Execu- 
tion 
Instruction Instruction Code Operation State TBit 
MOV.W @(disp,PC),Rn 1001innnndddddddd (dispx2+PC)—> 1 — 
| Sign extension — 
Rn 
MOV.W @(disp,Rm),RO 10000101mmmmdddd (disp x2+Rm)-> 1 — 
Sign extension > 
RO 
MOV.W  @(RO,Rm),Rn 0000nnnnmmmm1101 (RO+Rm)— Sign 1 — 
extension > Rn 
MOV.W  @Rm+,Rn 0110nnnnmmmm0101 (Rm) - Sign 1 — 
extension — Rn, 
| Rm+2— Rm 
MOV.W  @Rm,Rn 0110nnnnmmmm0001 (Rm) > Sign 1 — 
extension — Rn 
MOV.W  RO,@(disp,GBR) 11000001dddddddd RO — (disp x 2+ 1 — 
| GBR) 
MOV.W RO, @(disp,Rn) 10000001nnnndddd RO — (disp x2 + 1 — 
: Rn) 
MOV.W = Rm, @(RO, Rn) 0000nnnnmmmm0101 Rm — (RO + Rn) 1 — 
MOV.W ~~ Rm, @—-Rn 0010nnnnmmmm0101 Rn-2-— Rn, Rm 1 — 
(Rn) 
MOV.W Rm, @Rn 0010nnnnmmmm0001 Rm — (Rn) 1 — 
MOVA @(disp,PC),RO 11000111dddddddd disp x4+PC—+RO 1 — 
MOVE ~—s_ Rn 0000nnnn00101001 T— Rn 4 — 
MUL.L  Rm,Rn*? 0000nnnnmmmm0111 RnxRm-—-MACL 2to4*! — 
MULS.W  Rm,Rn 0010nnnnmmmm1111 Signed operation of 1to3*! — 
| Rn x Rm — MAC 
MULU.W  Rm,Rn 0010nnnnmmmm1110 Unsigned operation 1to3*! — 
of Rn x Rm — MAC 
NEG Rm, Rn 0110nnnnmmmm1011 0O—Rm — Rn 1 — 
NEGC Rm, Rn 0110nnnnmmmm1010 0O—Rm-T — Rn, 1 Borrow 
| | Borrow > T 
NOP 0000000000001001 No operation 1 — 
NOT Rm, Rn 0110nnnnmmmm0111 ~~Rm— Rn 1 — 
OR #imm, RO 11001011111i11ii RO | imm — RO 1 — 
OR Rm, Rn 0010nnnnmmmm1011 Rn | Rm— Rn 1 — 


Table 5.9 


Instruction Set (cont) 


Instruction 


OR.B 


ROTCL 
ROTCR 
ROTL 
ROTR 
RTE 


RTS 


SETT 
SHAL 
SHAR 


SHLL2 
SHLL8 
SHLL16 
SHLR 
SHLR2 
SHLR8 
SHLR16 
SLEEP 
STC 
STC 
STC 
STC .L 


STC .L 


SCL 


STS 


#imm, @ (RO, GBR) 


8/8|8)8 


S(8|F/(S Fis isis sis 


GBR, Rn 
SR, Rn 
VBR, Rn 
GBR, @—Rn 


SR, @-Rn 


VBR, @—Rn 


MACH, Rn 


Instruction Code 


0100nnnn00100100 
0100nnnn00100101 
0100nnnn00000100 
0100nnnn00000101 
0000000000101011 


0000000000001011 


0000000000011000 
0100nnnn00100000 
0100nnnn00100001 
0100nnnn00000000 
0100nnnn00001000 
0100nnnn00011000 
0100nnnn00101000 
0100nnnn00000001 
0100nnnn00001001 
0100nnnn00011001 
0100nnnn00101001 
0000000000011011 
0000nnnn00010010 
0000nnnn00000010 
0000nnnn00100010 
0100nnnn00010011 


0100nnnn00000011 
0100nnnn00100011 


0000nnnn00001010 
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Operation 


(RO + GBR) | imm 
— (RO + GBR) 


T<e—Rn<eTt 
T—> Rn T 
T + Rn — MSB 
LSB — Rn 5 T 


Delayed branch, 
stack area > 
PC/SR 


Delayed branch, 
PR — PC 


139 T 
T<Rn<O 
MSB > Rn > T 
T<Rn<O 
Rn<<2 — Rn 
Rn<<8 — Rn 
Rn<<16 — Rn 
0—~Rn->T 
Rn>>2 — Rn 
Rn>>8 — Rn 
Rn>>16 — Rn 
Sleep 

GBR — Rn 
SR — Rn 
VBR —> Rn 


Rn—4 > Rn, GBR 
— (Rn) 


Rn—4 — Rn, SR > 
(Rn) 


Rn—4 — Rn, VBR 
— (Rn) 


MACH -—> Rn 


Execu- 
tion 
State 


oN ee 


Oe ee ee ee Se eS ee ee ee Se ee ee ee ee ee 


T Bit 


MSB 
LSB 
MSB 
LSB 
LSB 


MSB 
LSB 
MSB 





Table 5.9 


Instruction Set (cont) 


Instruction 

STS MACL, Rn 

STS PR,Rn 

STS.L MACH , @—Rn 
STS.L MACL, @—Rn 
STS.L PR, @—Rn 

SUB Rm, Rn 

SUBC Rm, Rn 

SUBV Rm, Rn 
SWAP.B  km,Rn 

SWAP .W Rm, Rn 

TAS .B @Rn 

TRAPA #imm 

TST #imm, RO 

TST Rm, Rn 

TST.B #imm, @(RO,GBR) 
XOR #imm, RO 

XOR Rm, Rn 

XOR.B #imm, @(RO,GBR) 


XTRCT 


Rm, Rn 


Instruction Code | 
0000nnnn00011010 
0000nnnn00101010 
0100nnnn00000010 


0100nnnn00010010 
0100nnnn00100010 


0011 nnnnmmmm1000 
0011nnnnmmmm1010 


0011nnnnmmmm1011 


0110nnnnmmmm1000 
0110nnnnmmmm1001 


0100nnnn00011011 


0010nnnnmmmm1101 


Ww 
0) 


Operation 
MACL — Rn 
PR — Rn 


Rn—4 — Rn, 
MACH -> (Rn) 


Rn—4 — Rn, 
MACL —> (Rn) 


Rn—4 —- Rn, PR > 
(Rn) 
Rn—Rm — Rn 


Rn—Rm—-T — Rn, 
Borrow +> T 


Rn—Rm — Rn, 
Underflow > T 


Rm — Swap upper 
and lower 2 
bytes— Rn 


Rm — Swap upper 
and lower word—> 
Rn 

If (Rn) is 0, 1 > T; 
1 — MSB of (Rn) 
PC/SR — stack 


area, (imm x4 + 
VBR) > PC 


RO & imm; if the 
result is 0, 1 > T 


Rn & Rm; if the 
result is 0, 1 > T 


(RO + GBR) & imm; 
if the result is 0, 1 
7 T 


RO “ imm > RO 
Rn “Rm -—- Rn 


(RO + GBR) * imm 
— (RO + GBR) 


Center 32 bits of 
Rm and Rn + Rn 


Execu- 
tion 
State 


1 


1 


© 


T Bit 


Borrow 


Under- 
flow 


Test 
result 


Test 
result 


Test 
result 


Test 
result 


Section 6 Instruction Descriptions 


This section describes instructions in alphabetical order using the format shown below in section 
6.1. The actual descriptions begin at section 6.2. 


6.1 Sample Description (Name): Classification 


Class: Indicates if the instruction is a delayed branch instruction or interrupt disabled instruction 


Format Abstract Code State T Bit 
Assembler input format; A brief description of | Displayed in Number of — The value of 
imm and disp are operation order MSB ° states when _‘T bit after the 
numbers, expressions, LSB there isno __ instruction is 
or symbols wait state executed 


Description: Description of operation 


Notes: Notes on using the instruction 


Operation: Operation written in C language. This part is just a reference to help understanding of 
an operation. The following resources should be used. 


Reads data of each length from address Addr. An address error will occur if word data is read 
from an address other than 2n or if longword data is read from an address other than 4n: 


unsigned char Read_Byte(unsigned long Addr) ; 
unsigned short Read_Word(unsigned long Addr) ; 
unsigned long Read_Long (unsigned long Addr) ; 


Writes data of each length to address Addr. An address error will occur if word data is written 
to an address other than 2n or if longword data is written to an address other than 4n: 


unsigned char Write_Byte(unsigned long Addr, unsigned long Data); 
unsigned short Write_Word(unsigned long Addr, unsigned long Data); 
unsigned long Write_Long(unsigned long Addr, unsigned long Data); 


Starts execution from the slot instruction located at an address (Addr — 4). For Delay_Slot (4);, 
execution starts from an instruction at address O rather than address 4. The following 
instructions are detected before execution as illegal slot instruction (they become illegal slot 
instructions when used as delay slot instructions): 


BF, BT, BRA, BSR, JMP, JSR, RTS, RTE, TRAPA, BF/S, BT/S, BRAF, BSRF 


Delay_Slot (unsigned long Addr) ; 
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e List registers: 


unsigned long R[16]; 


unsigned long SR,GBR, VBR; 


unsigned long MACH, MACL, PR; 


unsigned long PC; 


¢ Definition of SR structures: 


struct SRO { 


unsigned 
unsigned 
unsigned 
unsigned 
unsigned 
unsigned 
unsigned 


aa 


long 
long 
long 
long 
long 
long 
long 


dummy0 : 22; 


MO:1; 
00:1; 
10:4; 


dummy 1 : 2; 


S0O:1; 
TO:1; 


¢ Definition of bits in SR: 


#define M ((* (struct SRO 
#define Q ((* (struct SRO 
#define S ((* (struct SRO 


#define T ((* (struct SRO 


¢ Error display function: 


Error( char *er ); 


*) (&SR) ) .MO) 
*) (&SR)) .Q0) 
*) (&SR) ) .S0) 
*) (&SR)) .TO) 


The PC should point to the location four bytes (the second instruction) after the current instruction. 
Therefore, PC = 4; means the instruction starts execution from address 0, not address 4. 


Examples: Examples are written in assembler mnemonics and describe state before and after 
executing the instruction. Characters in italics such as .align are assembler control instructions 
(listed below). For more information, see the Cross Assembler User's Manual. 
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Org Location counter set 

.data.w Securing integer word data 
.data.1 Securing integer longword data 
.sdata Securing string data 

.align 2 2-byte boundary alignment 
.align 4 2-byte boundary alignment 


.arepeat 16  16-repeat expansion 


.arepeat 32 32-repeat expansion 


.aendar 


Note: 


Notes: 1. 


End of repeat expansion of specified number 


The SH-series cross assembler version 1.0 does not support the conditional assembler 
functions. 


In the assembler descriptions in this manual for addressing modes that involve the 
following displacements (disp), the value prior to scaling (x1, x2, x4) according to the 
operand size is written. This is done to show clearly the operation of the LSI; see the 
assembler notation rules for the actural assembler descriptions. 

@(disp:4, Rn): Register indirect with displacement 

@(disp:8, GBR): GBR indirect with displacement 

@(disp 8,PC): PC relative with displacement 

disp:8, disp:12: | PC relative 


. Among the 16 bits of the instruction code, a code not assigned as an instruction is 


treated as a general illegal instruction, and will result in illegal instruction exception 
processing, This includes the case where an instruction code for the SH-2 CPU only is 
executed on the SH-1 CPU. 
Example 1: H'FFF [General illegal instruction in both SH-1 and SH-2 CPU] 
Example 2: H'3105 (=DMUL.L RO, R1)[Tllegal instruction in SH-1 CPU] 


. If the instruction following a delayed branch instruction such as BRA, BTYS, etc., isa 


general illegal instruction or a branch instruction (known as a slot illegal instruction), 
illegal instruction exception processing will be performed. 


Example 1 .... 
BRA Label 
data. W H'FFFF  €¢ Slot illegal instruction 
[H'FFF is fundamentally a general illegal 
instruction] 


Example 2 RTE 
BT/S Label ¢€ Slot illegal instruction 
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6.2 ADD(ADD Binary): Arithmetic Instruction 


Format Abstract Code State T Bit 
ADD Rm, Rn Rm + Rn > Rn 0011nnnnmmmm1100 1 — 
ADD #imm, Rn Rn + imm — Rn O1llinnnniiiiiiii 1 — 


Description: Adds general register Rn data to Rm data, and stores the result in Rn. The contents 
of Rn can also be added to 8-bit immediate data. Since the 8-bit immediate data is sign-extended 
to 32 bits, this instruction can add and subtract immediate data. 


Operation: 


ADD(long m, long n) /* ADD Rm,Rn */ 
{ 

R[{n]+=R[m] ; 

PC+=2; 
} 


ADDI (long i,long n) /* ADD #1imm,Rn */ 

{ 
if ((i&0x80)==0) R[n]+=(0x000000FF & (long) i); 
else R[n]+=(OxFFFFFF00 | (long)i); 
PC+=2 ; 

} 


Examples: 


ADD RO,R1 Before execution RO=H"7FFFFFFF, R1 = H'00000001 
After execution R1 = H'80000000 


ADD #H'01,R2 Before execution R2=H'00000000 
After execution R2 = H'00000001 


ADD #H'FE,R3 Before execution R3=H'00000001 
After execution R3 = H'FFFFFFFF 
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6.3  ADDC (ADD with Carry): Arithmetic Instruction 
Format Abstract Code State T Bit 


ADDC Rm,Rn Rn+Rm+T-—Rn, carry T 0011nnnnmmmm1110 1 Carry 





Description: Adds general register Rm data and the T bit to Rn data, and stores the result in Rn. 
The T bit changes according to the result. This instruction can add data that has more than 32 bits. 


Operation: 


ADDC (long m,long n) /* ADDC Rm,Rn */ 
{ 
unsigned long tmp0,tmp1; 


tmp1=R[n]+R[m] ; 
tmp0=R[n] ; 
R[{n]=tmp1+T; 

if (tmp0>tmp1) T=1; 


else T=0; 
if (tmpl>R[n]) T=1; 
PC+=2 ; 
} 
Examples: 
CLRT RO:R1 (64 bits) + R2:R3 (64 bits) = RO:R1 (64 bits) 
ADDC  R3, RL Before execution T = 0, R1 = H'00000001, R3 = H'FFFFFFFF 
After execution T = 1, RI = H'0000000 
ADDC R2,R0 Before execution T = 1, RO = H'00000000, R2 = H'00000000 
After execution T = 0, RO = H'00000001 


41 


6.4 ADDV (ADD with V Flag Overflow Check): Arithmetic Instruction _ 


Format. Abstract | i Code ? bas State T Bit 
ADDV Rm, Rn. —Rn+Rm—-> Rn, overflow > T 0011nnnnmmmm1111 1 Overflow © 


Description: Adds general register Rn data to Rm data, and stores the result in Rn. If an overflow 
occurs, the T bit is set to 1. 


Operation: 


ADDV (long m, long n) /*ADDV Rm,Rn */ 
{ 


long dest,src,ans; 


if ((long)R[n]>=0) dest=0; 
else dest=1; 

if ((long)R[m]>=0) src=0; 
else src=1; 

src+=dest; 

R[n]+=R[m] ; 

if ((long)R[n]>=0) ans=0; 


else ans=1; 


ans+=dest; 

if (src==0 || src==2) { 
if (ans==1) T=1; 
else T=0; 

} 

else T=0; 

PC+=2; 

} 
Examples: 


ADDV ~~ RO,R1 Before execution RO = H'00000001, R1 = H’7FFFFFFE, T = 0 
After execution R1 = H'7FFFFFFF, T = 0 


ADDV RO, R1 Before execution RO = H'00000002, R1 = H'7FFFFFFE, T = 0 
After execution R1 = H'80000000, T = 1 
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6.5 AND (AND Logical): Logic Operation Instruction 


Format Abstract Code State T Bit 

AND ~~ Rm,Rn Rn & Rm > Rn 0010nnnnmmmm1001 1 — 

AND #imm, RO RO & imm — RO 11001001i1i111111i 1 —_ 

AND.B #imm,@(RO,GBR) (RO+ GBR) &imm— (RO+ 11001101iiiiiiii 3 — 
GBR) | : 





Description: Logically ANDs the contents of general registers Rn and Rm, and stores the result in 
Rn. The contents of general register RO can be ANDed with zero-extended 8-bit immediate data. 
8-bit memory data pointed to by GBR relative addressing can be ANDed with 8-bit immediate 
data. 


Note: After AND #imm, RO is executed and the upper 24 bits of RO are always cleared to 0. 
Operation: 


AND (long m,long n) /* AND Rm,Rn */ 


{ 
R{n]&=R[m] 
PC+=2; 

} 

ANDI (long i) /* AND #imm,RO */ 

{ 
R[(0]&=(O0x000000FF & (long)i); 
PC+=2; 

} 

ANDM(long i) /* AND.B #imm,@(RO,GBR) */ 

{ 
long temp; 
temp= (long) Read_Byte(GBR+R[0]); 
temp&=(OxQ00000FF & (long)i); 
Write_Byte(GBR+R[0], temp) ; 
PC+=2; 

} 
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Examples: 


AND —-RO, R1 


AND #H' OF, RO 


AND.B #H'80,@(RO,GBR) 


Before execution 


After execution 


Before execution 


After execution 


Before execution 


After execution 


RO = H'AAAAAAAA, RI = H'55555555 
R1 = H'00000000 

RO= H'FFFFFFFF 

RO = H'0000000F 

@(RO,GBR) = H'AS 

@(RO,GBR) = H'80 


6.6 BF (Branch if False): Branch Instruction 





Format Abstract Code State T Bit 
BF label WhenT =0O, disp x2+PC — PC; 10001011dddddddd 3/1 — 
When T = 1, nop 


Description: Reads the T bit, and conditionally branches. If T = 1, BF executes the next 
instruction. If T = 0, it branches. The branch destination is an address specified by PC + 
displacement. The PC points to the starting address of the second instruction after the branch 
instruction. The 8-bit displacement is sign-extended and doubled. Consequently, the relative 
interval from the branch destination is —256 to +254 bytes. If the displacement is too short to reach 
the branch destination, use BF with the BRA instruction or the like. 


Note: When branching, three cycles; when not branching, one cycle. 
Operation: 


BF (long da) /* BF disp */ 
{ 
long disp; 


1£ ((d&0x80)==0) disp=(0x000000FF & (long)d); 
else disp=(0xFFFFFF00 | (long)d); | 
if (T==0) PC=PC+ (disp<<1) +4; 
else PC+=2; 
} 


Example: 


CLRT T is always cleared to 0 
BT TRGET_T Does not branch, because T = 0 
BF TRGET_F Branches to TRGET_F, because T = 0 


NOP 

NOP < The PC location is used to calculate the 
branch destination address of the BF 
instruction 


TRGET _F: < Branch destination of the BF instruction 
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6.7 BE/S (Beanch if False with Delay Slot): Branch Instruction (SH-2 
CPU) 


Class: Delayed branch instruction 


Format Abstract | | | Code | $tate TBit 


BF/S label WhenT =0, disp x 2 + PC > PC; 10001111dddddddd 2/1 — 
When T = 1, ue | SO 3 


Description: Reads the T bit, and conditionally branches with delay slot. If T = 1, BF executes the 
next instruction. If T = 0, it branches after executing the next instruction. The branch destination is 
an address specified by PC + displacement. The PC points to the starting address of the second 
instruction after the branch instruction. The 8-bit displacement is sign-extended and doubled. 
Consequently, the relative interval from the branch destination is —256 to +254 bytes. If the 
displacement is too short to reach the branch destination, use BF/S with the BRA instruction or the 
like. 


Note: Since this is a delayed branch instruction, the instruction immediately after is executed 
before the branch. Between the time this instruction and the instruction immediately after are 
executed, address errors or interrupts are not accepted. When the instruction immediately after is a 
branch instruction, it is recognized as an illegal slot instruction. 


When branching, this is a two-cycle instruction; when not branching, one cycle. 
Operation: 


BFS(long d) /* BFS disp */ 
{ 
long disp; 


unsigned long temp; 


temp=PC; 
if ((d&0x80)==0) disp=(0x000000FF & (long)d); 
else disp=(OxFFFFFFO0O | (long)d); 
ne ; = [ | . . 
PC=PC+ (disp<<1)+4; 
Delay_Slot(tempt2) ; 
| 
else PCt+=2; 
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Example: 


CLRT Tis always 0 
BT/S TRGET_T Does not branch, because T = 0 


NOP 

BF/S TRGET_F Branches to TRGET, because T = 0 

ADD RO,R1 Executed before branch 

NOP < The PC location is used to calculate the branch destination 
address of the BF/S instruction 


TRGET F: < Branch destination of the BF/S instruction 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.8 BRA (Branch): Branch Instruction 


Class: Delayed branch instruction 


Format Abstract Code State TBit 





BRA label disp x2 +PC > PC 1010dddddddddddd 2 = 





Description: Branches unconditionally after executing the instruction following this BRA 
instruction. The branch destination is an address specified by PC + displacement. The PC points to 
the starting address of the second instruction after this BRA instruction. The 12-bit displacement is 
sign-extended and doubled. Consequently, the relative interval from the branch destination is — 
4096 to +4094 bytes. If the displacement is too short to reach the branch destination, this 
instruction must be changed to the JMP instruction. Here, a MOV instruction must be used to 
transfer the destination address to a register. 


Note: Since this is a delayed branch instruction, the instruction after BRA is executed before 
branching. No interrupts or address errors are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot . 
instruction. 


Operation: 


BRA(long d) /* BRA disp */ 
{ 


unsigned long temp; 


long disp; 
if ((d&0x800)==0) disp=(OxO0000FFF & qd); 
else disp=(OxFFFFFOOO | d); 
temp=PC; 
PC=PC+ (disp<<1)+4; 
Delay_Slot(tempt2) ; 
} 


Example: 


BRA TRGET Branches to TRGET 
ADD RO,R1 Executes ADD before branching 


NOP < The PC location is used to calculate the branch destination 
address of the BRA instruction 
TRGET: < Branch destination of the BRA instruction 
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Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.9  BRAF (Branch Far): Branch Instruction (SH-2 CPU) 
Class: Delayed branch instruction — 


Format Abstract _ Code State TBit 
BRAF Rm Rm + PC + PC 0000mmmm00100011 2 0 — 





Description: Branches unconditionally. The branch destination is PC + the 32-bit contents of the 
general register Rm. PC is the start address of the second instruction after this instruction. 


Note: Since this is a delayed branch instruction, the instruction after BRAF is executed before 
branching. No interrupts or address errors are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 


BRAF(long m) /* BRAF Rm */ 
{ 


unsigned long temp; 


temp=PC; 

PC+=R[m] ; 

Delay_Slot(tempt2) ; 
} 


Example: 


MOV.L #(TRGET-BSRF_PC),RO Sets displacement 


BRAF GRO Branches to TRGET 
ADD RO,R1 Executes ADD before branching 
BRAF PC: < The PC location is used to calculate 
the branch destination address of 
the BRAF instruction 
NOP 
TRGET: <— Branch destination of the BRAF instruction 
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Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.10 BSR (Branch to Subroutine): Branch Instruction 


Class: Delayed branch instruction 


Format —s_—«(w® Abstract ' Code State TBit 
BSR _— label PC — PR, dispx2+PC—PC 1011dadddddddada 2 = 


Description: Branches to the subroutine procedure at a specified address after executing the 
instruction following this BSR instruction. The PC value is stored in the PR, and the program 
branches to an address specified by PC + displacement. The PC points to the starting address of 
the second instruction after this BSR instruction. The 12-bit displacement is sign-extended and 
doubled. Consequently, the relative interval from the branch destination is —4096 to +4094 bytes. 
If the displacement is too short to reach the branch destination, the JSR instruction must be used 
instead. With JSR, the destination address must be transferred to a register by using the MOV 
instruction. This BSR instruction and the RTS instruction are used for a subroutine procedure call. 


Note: Since this is a delayed branch instruction, the instruction after BSR is executed before 
branching. No interrupts or address errors are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 


BSR(long da) /* BSR disp */ 
if 
long disp; 


if ((d&0x800)==0) disp=(Ox00000FFF & d); 
else disp=(OxFFFFFOOO | d); 

PR=PC; 

PC=PC+ (disp<<1)+4; 

Delay_Slot(PR+t2) ; 
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Example: 


BSR TRGET Branches to TRGET 
MOV R3,R4 Executes the MOV instruction before branching 
ADD  RO,R1 <- The PC location is used to calculate the branch destination 


address of the BSR instruction (return address for when the 
subroutine procedure is completed (PR data)) 


@eoee e @ 6 


TRGET : <— Procedure entrance 
MOV R2,R3 
RTS Returns to the above ADD instruction 
MOV #1, RO Executes MOV before branching 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.11 BSRF (Branch to Subroutine Far): Branch Instruction (SH-2 CPU) 


Class: Delayed branch instruction 


Format | Abstract Code State T Bit 





BSRF Rn PC + PR,Rm+PC—+PC | 0000mmmm00000011 2 — 


Description: Branches to the subroutine procedure at a specified address after executing the 
instruction following this BSRF instruction. The PC value is stored in the PR. The branch 
destination is PC + the 32-bit contents of the general register Rm. PC is the start address of the 
second instruction after this instruction. Used as a subroutine procedure call in combination with 
RTS. 


Note: Since this is a delayed branch instruction, the instruction after BSR is executed before 

branching. No interrupts or address errors are accepted between this instruction and the next 

instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 
BSRF(long m) /* BSRF Rm */ 
{ 


PR=PC; 
PC+=R[m] ; 
Delay_Slot(PR+2) ; 
} 
Example: 
MOV.L #(TRGET-BSRF_PC),RO Sets displacement 
BRSF @RO Branches to TRGET 
MOV -R3, R4 Executes the MOV instruction before 
branching 
BSRF_PC: < The PC location is used to 
calculate the branch destination 
with BSRF 
ADD RO,R1 
TRGET: <- Procedure entrance 
MOV R2,R3 
RTS Returns to the above ADD instruction 
MOV = #1, RO Executes MOV before branching 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed branch 
instruction, then delay slot instruction. For example, even if the register in which the branch 
destination address has been loaded is changed by the delay slot instruction, the branch will 
still be made using the value of the register prior to the change as the branch destination 
address. 
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6.12 BT (Branch if True): Branch Instruction 


Format Abstract Code State T Bit 
BT label When T = 1, disp x 2+ PC > 10001001dddddddd 3/1 — 
PC; | 
When T = 0, nop 


Description: Reads the T bit, and conditionally branches. If T = 1, BT branches. If T = 0, BT 
executes the next instruction. The branch destination is.an address specified by PC + 
displacement. The PC points to the starting address of the second instruction after the branch 
instruction. The 8-bit displacement is sign-extended and doubled. Consequently, the relative 
interval from the branch destination is —-256 to +254 bytes. If the displacement is too short to reach 
the branch destination, use BT with the BRA instruction or the like. 


Note: When branching, requires three cycles; when not branching, one cycle. 
Operation: 


BT(long d) /* BT disp */ 
{ 
long disp; 


if ((d&0x80)==0) disp=(Ox0O0000FF & (long)d); 
else disp=(OxFFFFFF00O | (long)d); 
if (T==1) PC=PC+(disp<<1)+4; 


else PCt=2; 
} 
Example: 
SETT T is always 1 
‘BF TRGET_F Does not branch, because T = 1 
BT TRGET_T Branches to TRGET_T, because T = 1 
NOP 
NOP < The PC location is used to calculate the branch destination 
address of the BT instruction 
TRGET_T: < Branch destination of the BT instruction 


55 


6.13. BT/S (Branch if True with Delay Slot): Branch Instruction (SH-2 | 


CPU) | 
Format Abstract Code State ‘TT Bit 
BT/S label | WhenT=1,dispx2+PC-»  10001101dddddddd 21 — 
PC; 
When T = 0, nop 


Description: Reads the T bit, and conditionally branches with delay slot. If T = 1, BT/S branches 
after the following instruction executes. If T = 0, BT/S executes the next instruction. The branch 
destination is an address specified by PC + displacement. The PC points to the starting address of 
the second instruction after the branch instruction. The 8-bit displacement is sign-extended and 
doubled. Consequently, the relative interval from the branch destination 1s —256 to +254 bytes. If 
the displacement is too short to reach the branch destination, use BT/S with the BRA instruction or 
the like. 


Note: Since this is a delay branch instruction, the instruction immediately after is executed before 
the branch. Between the time this instruction and the immediately after instruction are executed, 
address errors or interrupts are not accepted. When the immediately after instruction is a branch 
instruction, it is recognized as an illegal slot instruction. When branching, requires two cycles; 
when not branching, one cycle. 3 


Operation: 


BTS(long d) /* BTS disp */ 
{ 
long disp; 


unsigned long temp; 


temp=PC ; 
if ((d&0x80)==0) disp=(OxOO0000FF & (long)d); 
else disp=(0OxFFFFFF0O0 | (long)d); 
if (T==1) { 
PC=PC+(disp<<1)+4; 
Delay_Slot(temp+2) ; 
} 
else PCt=2; 
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Example: 


SETT T is always 1 
BF/S TRGET_F Does not branch, because T = 1 
NOP 


BT/S TRGET_T Branches to TRGET, because T = 1 
ADD RO,R1 Executes before branching. 


NOP « The PC location is used to calculate the branch destination 
address of the BT/S instruction 
TRGET_T: < Branch destination of the BT/S instruction 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.14 CLRMAC (Clear MAC Register): System Control Instruction 


Format Abstract Code | State T Bit 
CLRMAC O-»MACH,MACL  —T 0000000000101000 1 ree 


Description: Clears the MACH and MACL registers. 
Operation: 


CLRMAC() /* CLRMAC */ 
{ 
MACH=0; 
MACL=0; 
PC+=2; 
} 


Example: 


CLRMAC Initializes the MAC register 
MAC .W @RO+, @R1+ Multiply and accumulate operation 
MAC .W @RO+, @R1+ 
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6.15 CLRT (Clear T Bit): System Control Instruction 
Format Abstract Code State 
CLRT O->T 0000000000001000 1 


Description: Clears the T bit. 
Operation: 


CLRT() /* CLRT */ 
{ 


PC+=2; 
} 
Example: 
CLRT Before execution T=1 
| After execution T=0 


59 


T Bit 


6.16 CMP/cond (Compare Conditionally): Arithmetic Instruction. 





Format Abstract Code State TBit 
CMP/EQ Rkm,Rn $WhenRn=Rm,1—-T 0O11innnnmmmm0000 ~—Ss 1 Comparison 
oe : | - 2 42? . result 
CMP/GE Rm,Rn When signed and Rn 2 0011nnnnmmmm0011 1 Comparison 
Rm, 1 > T result — 
CMP/GT kRm,Rn When signed and Rn > 0011innnnmmmm0111 1 Comparison 
Rm, 13 T result 
CMP/HI kRm,Rn When unsigned and Rn> 0011nnnnmmmm0110 1 _ Comparison 
Rm, 1 T result 
CMP/HS kRm,Rn When unsigned and Rn= = 001 1nnnnmmmm0010 1 Comparison 
Rm, 1—>T result. 
CMP/PL Rn When Rn>0, 17> T 0100nnnn00010101 1 Comparison 
result 
CMP/PZ Rn When Rn 20, 1 > T 0100nnnn00010001 1 Comparison 
| result 
CMP/STR Rm,Rn When. a byte in Rn equals 0010nnnnmmmm1100 1 Comparison 
_ abytein Rm, 17 T oo result 
CMP/EQ #imm,RO When RO=imm,1—>T 10001000iiiiiiii 1 Comparison 
result 





Description: Compares general register Rn data with Rm data, and sets the T bit to 1 if a specified 
condition (cond) is satisfied. The T bit is cleared to 0 if the condition is not satisfied. The Rn data 
does not change. The following eight conditions can be specified. Conditions PZ and PL are the 
results of comparisons between Rn and 0. Sign-extended 8-bit immediate data can also be 
compared with RO by using condition EQ. Here, RO data does not change. Table 6.1 shows the 
mnemonics for the conditions. 


60 


Table 6.1 CMP Mnemonics 

Mnemonics Condition 

CMP/EQ Rm, Rn If Rn = Rm, T =1 

CMP/GE Rm, Rn If Rn = Rm with signed data, T = 1 
CMP/GT Rm, Rn If Rn > Rm with signed data, T = 1 
CMP/HI Rm, Rn If Rn > Rm with unsigned data, T = 1 
CMP/HS Rm, Rn If Rn > Rm with unsigned data, T = 1 
CMP/PL Rn If Rn >O, T = 1 

CMP/PZ Rn lf Rn >0, T= 1 


CMP/STR Rm, Rn 
CMP/EQ #imm,RO 


Operation: 


CMPEQ(long m,long n) 


{ 


If a byte in Rn equals a byte in Rm, T = 1 
lf RO = imm, T = 1 


/* CMP_EQ Rm,Rn */ 


if (R[n]==R[{m}) T=1; 


else T=0; 


PC+=2; 


} 


CMPGE(long m,long n) 


{ 


/* CMP_GE Rm,Rn */ 


if ((long)R[n]>=(long)R[m]) T=1; 


else T=0; 


PC+=2; 


} 


CMPGT(long m,long n) 


{ 


/* CMP_GT Rm,Rn */ 


if ((long)R[n]>(long)R[m]) T=1; 


else T=0; 


PCt+=2; 
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CMPHI (long m,long n) /* CMP_HI Rm,Rn */ 
{ 
if ((unsigned long)R[n]>(unsigned long)R{[m] ) T=1; 


else T=0; 
PC+=2; 

, 

CMPHS(long m,long n) /* CMP_HS Rm,Rn */ 
if ((unsigned long)R[n]>=(unsigned long)R[m]) T=1; 
else T=0; 

PCt+=2; 

} 

CMPPL(long n) /* CMP_PL Rn */ 

{ 
if ((long)R[n]>0) T=1; 
else T=0; 

PC+=2; 

} 


CMPPZ(long n) /* CMP_PZ Rn */ 
{ 
if ((long)R[n]>=0) T=1; 
else -T=0; 


PC+=2; 


62 


CMPSTR(long m,long n) /* CMP_STR Rm,Rn */ 
{ 
unsigned long temp; 


long HH,HL,LH, LL; 


temp=R[n]*R[m] ; 
HH=(temp>>12 ) &OxOOOOOOFF; 
HH=(temp>>8 ) SOXOOOOOOFF; 
HH=(temp>>4 ) &OxO00000FF; 
LL=temp&OxOOOQOOO0FF; 
HH=HH&&HL&&LH&&LL; 

if (HH==0) T=1; 


else T=0; 
PCt+=2; 

} 

CMPIM(long i) /* CMP_EQ #imm,RO */ 

{ 
long imm; 
if ((i&0x80)==0) imm=(0Ox000000FF & (long i)); 
else imm=(OxFFFFFFOO | (long i)); 
if (R[O]==imm) T=1; 
else T=0; 
PC+=2; 

} 

Example: 

CMP/GE RO,R1 RO = H'7FFFFFFF, R1 = H'80000000 
BT TRGET_T Does not branch because T = 0 
CMP/HS RO,R1 RO = H'7FFFFFFF, R1 = H'80000000 
BT TRGET_T Branches because T = 1 
CMP/STR R2,R3 R2 = “ABCD”, R3 = “X YCZ” 
BT TRGET_T Branches because T = 1 
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6.17 DIVOS (Divide Step 0 as Signed): Arithmetic Instruction 


Format Abstract Code State T Bit 
DIVOS Rm,Rn MSBofRn—-Q,MSBofRm— 0010nnnnmmmm0111 A: Calculation 
M, M4Q > T result 





Description: DIVOS is an initialization instruction for signed division. It finds the quotient by 
repeatedly dividing in combination with the DIV1 or another instruction that divides for each bit 
after this instruction. See the description given with DIV1 for more information. 


Operation: 


DIVOS(long m,long n) 7* DIVOS Rm,Rn */ 
{ 

if ((R[n]&0x80000000)==0) Q=0; 

else Q=1; 

if ((R[m] &0x80000000)==0) M=0; 

else M=1; 

T=! (M==Q); 

PC+=2; 
} 


Example: See DIV1. 


6.18 DIVOU (Divide Step 0 as Unsigned): Arithmetic Instruction 


Format Abstract Code , State T Bit 
DIVOU 0 > MQ/T. 0000000000011001 1 0 


Description: DIVOU is an initialization instruction for unsigned division. It finds the quotient by 
repeatedly dividing in combination with the DIV1 or another instruction that divides for each bit 
after this instruction. See the description given with DIV1 for more information. 


Operation: 


DIVOU() /* DIVOU */ 
{ 


M=Q=T=0; 
PC+=2; 
} 
Example: See DIV1. 
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6.19 DIV1 (Divide Step 1): Arithmetic Instruction — 





Format | Abstract oS 8 Code -. - State T Bit. 
DIV1 Rm,Rn 1-step division (Rn + Rm) 0011nnnnmmmm0100 1 Calculation 
. —: | a a _ ‘result 


Description: Uses single-step division to divide one bit of the 32-bit data in general register Rn — 
(dividend) by Rm data (divisor). It finds a quotient through repetition either independently or used 
in combination with other instructions. During this repetition, do not rewrite the specified een 
or the M, Q, and T bits. ; : 


In one-step division, the dividend is shifted one bit left, the divisor is subtracted and the quotient 

bit reflected in the Q bit according to the status (positive or negative). To find the remainder in a 

division, first find the quotient using a DIV1 instruction, then find the remainder as follows: 
(Dividend) — (divisor) x (quotient) = (remainder) 

with the SH-2 CPU in which a divider is installed as a peripheral function, the remainder can be 

found as a function of the divider. 


Zero division, overflow detection, and remainder operation are not supported. Check for zero) 
division and overflow division before dividing. | 


Find the remainder by first finding the sum of the divisor and the quotient obtained and then 
subtracting it from the dividend. That is, first initialize with DIVOS or DIVOU. Repeat DIV1 for 
each bit of the divisor to obtain the quotient. When the quotient requires 17 or more bits, place 
ROTCL before DIV1. For the division sequence, see the following examples. 
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Operation: 


DIV1(long m,long n)- /* DIV1 Rm,Rn */ 
{ 
unsigned long tmp0; 
unsigned char old_q,tmp1; 
old_q=Q; 
Q=(unsigned char) ((0x80000000 & R[n])!=0); 
R({n]<<=1; 


R[{n] |=(unsigned long)T; 
switch(old_q) { 
case 0:switch(M){ 
case 0:tmp0=R[n]; 
R[n]-=R[m] ; 
tmp1=(R[n]>tmpo) ; 
switch (Q) { 
case 0:Q=tmpl1; 
break; 
case 1:Q=(unsigned char) (tmpl==0) ; 
break; 
} 
break; 
case 1:tmp0=R[n]; 
R[n)+=R[m]; 
tmp1=(R[n]<tmp0) ; 
switch(Q){ 
case 0:Q=(unsigned char) (tmp1==0) ; 
break; 
case 1:Q=tmpl1; 
| break; 
} 
break; 
} 


break; 
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case 1:switch(M) { 


} 


case 0:tmp0=R[n]; 
R[n]+=R[m] ; 
tmp1=(R[n]<tmp0) ; 
switch(Q) { 
case 0:Q=tmp1; 
break; 
case 1:Q=(unsigned char) (tmp1==0) ; 
break; 
} 
break; 
case 1:tmp0=R[n]; 
R[n]-=R[m] ; 
tmp1=(R[n]>tmp0) ; 
Switch(Q) { 
case 0:Q=(unsigned char) (tmp1==0) ; 
break; 
case 1:Q=tmpl1; 
break; 
} 
break; 


} 


break; 


T=(Q==M) ; 
PC+=2; 
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Example 1: 


SHLL16 
TST 

BT 
CMP/HS 
BT 
DIVOU 
.arepeat 
DIV1 
.aendr 
ROTCL 
EXTU .W 


Example 2: 


TST 

BT 
CMP/HS 
BT 

DIVOU 
.arepeat 
ROTCL 

DIV1 
.aendr 


ROTCL 


RO 

RO, RO 
ZERO_DIV 
RO,R1 
OVER_DIV 


16 
RO,R1 


R1,R2 


RO, RO 
ZERO_DIV 
RO,R1 
OVER_DIV 


32 
R2 
RO,R1 


R1 (32 bits) / RO (16 bits) = R1 (16 bits): Unsigned 
Upper 16 bits = divisor, lower 16 bits = 0 

Zero division check 

Overflow check 


Flag initialization 


Repeat 16 times 


R1 = Quotient 


R1:R2 (64 bits)/RO (32 bits) = R2 (32 bits): Unsigned 


Zero division check 


Overflow check 


Flag initialization 


Repeat 32 times 


R2 = Quotient 
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Example 3: 


R1 (16 bits)/RO (16 bits) = R1 (16 bits):Signed 


SHLL16 RO Upper 16 bits = divisor, lower 16 bits = 0 

EXTS.W R1,R1 Sign-extends the dividend to 32 bits 

XOR R2,R2 R2=0 

MOV R1,R3 

ROTCL R3 

SUBC R2,R1 Decrements if the dividend is negative 

DIVOS RO,R1 Flag initialization 

.arepeat 16 

DIV1 RO,R1 Repeat 16 times 

.aendr 

EXTS .W R1,R1 

ROTCL R1 R1 = quotient (one’s complement) 

ADDC R2,R1 Increments and takes the two’s complement if the MSB of the 

» quotient is 1 

EXTS.W R1,R1 R1 = quotient (two’s complement) 

Example 4: 


R2 (32 bits) / RO (32 bits) = R2 (32 bits):Signed 


MOV R2,R3 
ROTCL R3 
SUBC R1,R1 Sign-extends the dividend to 64 bits (R1:R2) 
XOR R3,R3 R3 =0 
SUBC R3,R2 Decrements and takes the one’s complement if the dividend is 
negative 
DIVOS RO,R1 Flag initialization 
.arepeat 32 
ROTCL R2 Repeat 32 times 
DIV1 RO,R1 
,aendr 
ROTCL R2 R2 = Quotient (one’s complement) 
ADDC R3,R2 Increments and takes the two’s complement if the MSB of the 


quotient is 1. R2 = Quotient (two’s complement) 
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6.20 DMULS.L (Double-Length Multiply as Signed): Arithmetic 
Instruction (SH-2 CPU) 


Format Abstract Code State T Bit 
DMULS.L Rm,Rn  £Withsigned, Rn xRm—- 0011nnnnmmmm1101 2to4 — 
MACH, MACL | 


Description: Performs 32-bit multiplication of the contents of general registers Rn and Rm, and 
stores the 64-bit results in the MACL and MACH registers. The operation is a signed arithmetic 
operation. 


Operation: 


DMULS(long m,long n) /* DMULS.L Rm,Rn */ 

{ 
unsigned long RnL,RnH, RmL, RmH, ResO,Resl1,Res2; 
unsigned long temp0,temp1,temp2,temp3; 
long tempm,tempn, fnLmL; 


tempn=(long)R[n]; 
tempm=(long)R[m] ; 

if (tempn<0) tempn=0-tempn; 

if (tempm<0) tempm=0-tempm; 

if ((long)(R[n]*R[m])<0) fnLmL=-1; 
else fnLmL=0; 


templ=(unsigned long)tempn; 


temp2=(unsigned long)tempm; 


RnL=temp1&0x0000FFFF; 
RnH=(temp1>>16 ) &0xO0000FFFF; 
RmL=temp2&0x0000FFFF ; 
RmH=(temp2>>16) &0x0000FFFF; 


temp0O=RmL*RnL; 
temp1=RmH*RnL; 
temp2=RmL*RnH ; 
temp3=RmH*RnH ; 


v1 


Res2=0 
Res1=temp1+temp2 ; 
if (Resl<temp1) Res2+=0x00010000; 


temp1=(Res1<<16 ) &OxFFFF0000; 
ResO=temp0+temp]l ; 
if (ResO<temp0) Res2++; 


Res2=Res2+( (Res1>>16) &SOxOOOOFFFF ) +temp3; 


if (fnLmL<0) { 
Res 2=~Res2; 
if (Res0==0) 
Res2++; 
else 
Res0=(~Res0)+1; 
} 
MACH=ReS2; 
MACL=Res0; 
PC+=2; 
} 


Example: 


DMULS RO,R1 Before execution RO = H'FFFFFFFE, R1 = H'00005555 

After execution MACH = H'FFFFFFFF, MACL = H'FFFF5556 
STS MACH, RO Operation result (top) 
STS MACL, RO Operation result (bottom) 
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6.21 DMULU.L (Double-Length Multiply as Unsigned): Arithmetic 
Instruction (SH-2 CPU) 


Format Abstract Code State T Bit 
DMULU.L  Rm,Rn Without signed, Rn x Rm —-> 0011nnnnmmmm0101 2to4 — 
MACH, MACL 


Description: Performs 32-bit multiplication of the contents of general registers Rn and Rm, and 
stores the 64-bit results in the MACL and MACH registers. The operation is an unsigned 
arithmetic operation. 


Operation: 


DMULU(long m,long n) /* DMULU.L Rm,Rn */ 

{ 
unsigned long RnL,RnH,RmL, RmH, ResOQ,Resl1,Res2; — 
unsigned long temp0,templ1,temp2,temp3; 


RnL=R [n] &OxOOOOFFFF; 
RnH=(R[n]>>16) &OxOO0OFFFF; 


RmL=R [m] &0x0000FFFF; 
RmH=(R[m]>>16 ) S0x0000FFFF; 


temp0O=RmL*RnL; 
temp1=RmH*RnL; 
temp2=RmL*RndH ; 
temp3=RmH*RnH ; 


Res2=0 

Resl=templ1+temp2 ; 

if (Resl<templ) Res2+=0x00010000; 
temp1=(Res1<<16 ) &OxFFFF0000; 
Res0=temp0+temp!1 ; 


if (ResO<temp0) Res2++; 


Res2=Res2+( (Res1>>16)&Ox0000FFFF )+temp3 ; 
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MACH=Res2; 


MACL=Res0; _ 


PC+=2; 
ar 
Example: 


DMULU RO,R1 


STS MACH, RO 
STS MACL, RO 


Before execution RO = H'FFFFFFFE, R1 = H'00005555 


After execution. MACH = H'00005554, MACL =H 


Operation result (top) 
Operation result (bottom) 
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'FFFF5556 


6.22 DT (Decrement and Test): Arithmetic Instruction (SH-2 CPU) 


Format Abstract Code State T Bit 
DT Rn Rn - 1 — Rn; 0100nnnn00010000_~—=Ss ‘1 Comparison 
When Rn is 0, 1 > T, result 


when Rn is nonzero, 0 > T 


Description: The contents of general register Rn is decremented by 1 and the result is compared 
to 0 (zero). When the result is 0, the T bit is set to 1. When the result is not zero, the T bit is set to 
0. 


Operation: 


DT(long n) /* DT Rn */ 
{ 


R[n)--; 
if (R{n]==0) T=1; 
else T=0; 
PC+=2; 
} 
Example: 
MOV #4,R5 Sets the number of loops. 
LOOP : 
ADD RO,R1 
DT RS Decrements the R5 value and checks whether it has become 0. 
BF LOOP Branches to LOOP if T=0. (In this example, loops 4 times.) 
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6.23 EXTS (Extend as Signed): Arithmetic Instruction | 


Format 
EXTS.B  Rn,Rn 


EXTS.W Rn,Rn © 


Abstract | Code So State T Bit 
Sign-extended Rm from byte > 0110nnnnmmmm1110 1 — 
Rn | 

_ Sign-extended Rm from word + 0110nnnnmmmm1111- 1 — 
Rn 





Description: Sign-extends general register Rm data, and stores the result in Rn. If byte length is 
specified, the bit 7 value of Rm is transferred to bits 8 to 31 of Rn. If word length is specified, the 
bit 15 value of Rm is transferred to bits 16 to 31 of Rn. 


Operation: 


EXTSB(long m,long n) 


{ 


R[{n]=R[m]; 


/* EXTS.B Rm,Rn */ 


if ((R[m] &0x00000080)==0) R[n]&=OxO0O0000FF; 
else R[n] |=OxFFFFFFOO; 


PC+=2; 
} 


EXTSW(long m,long n) 


{ 


R[n]=R[m]; 


/* EXTS.W Rm,Rn */ © 


if ((R[m]&0x00008000)==0) R[n]&=Ox0000FFFF; 
else R[n] |=OxFFFFOO0OO; 


PC+=2; 
} 


Examples: 


EXTS.B RO,R1 


EXIS.W RO,R1 


Before execution 
After execution 
Before execution 


After execution 


RO = H'00000080 
R1 = H'FFFFFF80 
RO = H'00008000 
R1 = H'FFFF8000 
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6.24 EXTU (Extend as Unsigned): Arithmetic Instruction 


Format Abstract Code State T Bit 
EXTU.B Rm,Rn Zero-extend Rm from byte > Rn  0110nnnnmmmm1100 1 — 
EXTU.W Rm,Rn Zero-extend Rm from word > Rn = 011 0nnnnmmmm1101 1 — 


Description: Zero-extends general register Rm data, and stores the result in Rn. If byte length is 
specified, 0 is transferred to bits 8 to 31 of Rn. If word length is specified, 0 is transferred to bits 
16 to 31 of Rn. 


Operation: 


EXTUB(long m,long n) /* EXTU.B Rm,Rn */ 
{ 

R{n]=R[m]; 

R[n] &=Ox000000FF; 

PC+=2; 
} 


EXTUW(long m,long n) /* EXTU.W Rm,Rn */ 
{ 


R[n]=R[m] ; 
R[n] &=Ox0000FFFF; 
PC+=2; 
} 
Examples: 


EXTU.B RO,R1 Before execution RO =H'FFFFFF80 
After execution R1 = H'00000080 

EXTU.W RO,R1 Before execution RO=H'FFFF8000 
After execution R1 = H'00008000 
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6.25 JMP (Jump): Branch Instruction 
Class: Delayed branch instruction 


Format Abstract | | Code State =‘ T Bit 





IMP @Rm Rm — PC | 0100mmmm00101011 2 a 


Description: Delayed-branches unconditionally to the address specified with register indirect. The 
branch destination is an address specified by the 32-bit data in general register Rm. | 


Note: Since this is a delayed branch instruction, the instruction after JMP is executed before 
branching. No interrupts or address errors are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 


JMP(long m) /* JMP @Rm */ 
{ 


unsigned long temp; 


temp=PC; 
PC=R[m]+4; 
Delay_Slot(temptz2) ; 


} 
Example: 

MOV.L JMP_TABLE,RO Address of RO = TRGET 
JMP @RO - Branches to TRGET 
MOV RO,R1 Executes MOV before branching 
.align 4 | 

JMP_TABLE: .data.1  TRGET Jump table 

TRGET: ADD #1,R1 < Branch destination 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.26 JSR (Jump to Subroutine): Branch Instruction 


Class: Delayed branch instruction 


Format Abstract Code State T Bit 
JSR @Rm PC + PR, Rm > PC 0100mmmm00001011 2 — 


Description: Delayed-branches to the subroutine procedure at a specified address after executing 
the instruction following this JSR instruction. The PC value is stored in the PR. The jump 
destination is an address specified by the 32-bit data in general register Rm. The PC points to the 
starting address of the second instruction after JSR. The JSR instruction and RTS instruction are 
used for subroutine procedure calls. 


Note: Since this is a delayed branch instruction, the instruction after JSR is executed before 
branching. No interrupts and address errors are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 


JSR(long m) /* JSR @Rm */ 
{ 
PR=PC; 
PC=R[m]+4; 
Delay_Slot(PR+2); 
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Example: 


MOV.L JSR_TABLE, RO RO = Address of TRGET 


JSR @RO Branches to TRGET 
XOR R1,R1 Executes XOR before branching - 
ADD RO,R1 < Return address for when the 
subroutine procedure is completed | 
(PR data) 
align 4 
JSR_TABLE: .data.1 TRGET Jump table 
TRGET : NOP < Procedure entrance 
MOV -R2,R3 
RTS Returns to the above ADD instruction 
MOV #70,R1 Executes MOV before RTS 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the _ 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.27 LDC (Load to Control Register): System Control Instruction 


Class: Interrupt disabled instruction 


Format Abstract Code State T Bit 
LDC Rm,SR Rm — SR 0100mmmm00001110 1 LSB 
LDC Rm,GBR Rm — GBR 0100mmmm00011110 1 — 
LDC Rm, VBR Rm — VBR 0100mmmm00101110 1 — 
LDC.L @Rm+,SR (Rm) — SR, Rm + 4 > Rm 0100mmmm00000111 3 LSB 
LDC.L @Rm+,GBR (Rm) > GBR, Rm+4-— Rm 0100mmmm00010111 3 — 
LDC.L @Rm+,VBR (Rm) —> VBR, Rm+4-— Rm 0100mmmm00100111 3 — 





Description: Stores the source operand into control registers SR, GBR, or VBR. 


Note: No interrupts are accepted between this instruction and the next instruction. Address errors 
are accepted. 


Operation: 


LDCSR(long m) /* LDC Rm,SR */ 
{ 

SR=R[m] &0x000003F3; 

PC+=2; 


} 


LDCGBR(long m) /* LDC Rm,GBR */ 
{ 

GBR=R[m] ; 

PC+=2; 
} 
LDCVBR(long m) /* LDC Rm,VBR */ 
{ 

VBR=R[m] ; 

PC+=2; 
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LDCMSR(long m) /* LDC.L @Rm+,SR */ 

; | | a eee 
SR=Read_Long(R[m] )&0x000003F3; 
R[Km]+=4; 
PC+=2; 

} 


LDCMGBR(long m) /* LDC.L @Rm+,GBR */ 
, 
GBR=Read_Long(R[m]); 
Rm] +=4;_ 
PC+=2; 
} 


LDCMVBR(long m) /* LDC.L @Rm+,VBR */ | 
{ 

VBR=Read_Long(R[m] ); 

R[m]+=4; 

PC+=2; 
} 


Examples: 


LDC RO,SR Before execution §RO=H'FFFFFFFF, SR = H'00000000 — 
After execution SR = H'000003F3 


LDC.L @R15+,GBR Before execution R15 = H'10000000 
After execution R15 = H'10000004, GBR = @H'10000000 
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6.28 LDS (Load to System Register): System Control Instruction 


Class: Interrupt disabled instruction 


Format Abstract Code State T Bit 
LDS Rm, MACH Rm — MACH 0100mmmm00001010 1 — 
LDS Rm, MACL Rm — MACL 0100mmmm00011010 1 — 
LDS Rm, PR Rm — PR 0100mmmm00101010 1 — 
LDS.L @Rm+,MACH (Rm)—MACH,Rm+4-— Rm = 0100mmmm00000110 1 — 
LDS.L @kRm+,MACL (Rm) — MACL, Rm +4-— Rm 0100mmmm00010110 1 — 
LDS.L @Rm+,PR (Rm) > PR, Rm + 4 > Rm 0100mmmm00100110 1 — 


Description: Stores the source operand into the system registers MACH, MACL, or PR. 


Note: No interrupts are accepted between this instruction and the next instruction. Address errors 
are accepted. 


For the SH-1 CPU, the lower 10 bits are stored in MACH. For the SH-2 CPU, 32 bits are stored in 
MACH. 


Operation: 
LDSMACH (long m) /* LDS Rm,MACH */ 
{ 
MACH=R[m] ; 
Lf ((MACH&0x00000200)==0) MACH6=0x000003FF; | For SH-1 CPU(these 2 lines 
else MACH|=OxFFFFFCOO; not needed for SH-2 CPU) 
a oe eee scsstieiab Sacenen tect tan cenisnnas: ; 
| 
LDSMACL(long m) /* LDS Rm,MACL */ 
{ 
MACL=R[m] ; 
PC+=2; 
} 
LDSPR(long m) /* LDS Rm,PR */ 
{ 
PR=R[m] ; 
PC+=2; 
} 


83 


LDSMMACH (long m) /* LDS.L @Rm+,MACH */ 


MACH=Read_Long(R[m] ); 


if ((MACH&0x00000200)==0) MACH&=0x000003FF; E For SH-1 CPU (these 2 lines | 


else MACH|=0xFFFFFCO0; not needed for SH-2 CPU) 
a. Sa 

PCt=2; 

} 

LDSMMACL (long m) /* LDS.L @Rmt+,MACL */ 

{ 
MACL=Read_Long(R[m]); 
R[m]+=4; 
PCt+=2; 

} 

LDSMPR(long m) /* LDS.L @Rm+,PR */ 

{ 
PR=Read_Long(R[m] ); 
R({m]+=4; 
PC+=2; 

} 

Examples: 
LDS RO,PR Before execution RO =H'12345678, PR = H'00000000 
After execution PR = H'12345678 
LDS.L @R15+,MACL Before execution R15 = H'10000000 


After execution R15 = H'10000004, MACL = @H'10000000 


6.29 MAC.L (Multiply and Accumulate Long): Arithmetic Instruction 


(SH-2 CPU) 
Format Abstract Code State T Bit 
MAC.L @Rm+,@Rn+ Signed operation, (Rn) x (Rm) + 0000nnnnmmmm1111 3/(2to — 
MAC + MAC 4) 


Description: Signed-multiplicates 32-bit operands obtained using the contents of general registers 
Rm and Rn as addresses. The 64-bit result is added to contents of the MAC register, and the final 
result is stored in the MAC register. Every time an operand is read, they increment Rm and Rn by 
four. 


When the S bit is cleared to 0, the 64-bit result is stored in the coupled MACH and MACL 
registers. When bit S is set to 1, addition to the MAC register is a saturation operation at the 48th 
bit starting from the LSB. For the saturation operation, only the lower 48 bits of the MACL 
registers are enabled and the result is limited to a range of H'FFFF800000000000 (minimum) to 
H'00007FFFFFFFFFFF (maximum). 


Operation: 


MACL(long m,long n) /* MAC.L @Rm+, @Rn+*/ 

{ 
unsigned long RnL,RnH,RmL, RmH, Res0,Res1,Res2; 
unsigned long temp0,templ,temp2,temp3; 
long tempm,tempn, fnLmL; 


tempn=(long)Read_Long(R[n]); 
R[n]+=4; 
tempm=(long)Read_Long(R[m] ); 
R[{m]+=4; 


if ((long) (tempn*tempm)<0) fnLmL=-1; 
else fnLmL=0; 

if (tempn<0) tempn=0-tempn; 

if (tempm<0) tempm=0-tempm; 


temp1=(unsigned long)tempn; 


temp2=(unsigned long)tempm; 
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RnL=temp1&0x0000FFFF; 

- RnH=(temp1>>16 ) S0xOO000FFFF; 
RmL=temp2&0x0000FFFF; 

_ RmH=(temp2>>16 ) &0x0000FFFF; 


tempO=RmL*RnL; 
temp1=RmH*RnL; 
temp2=RmL*RnH; 
temp3=RmH*RnH ; 


Res2=0; 
Resl=templ+temp2; 
if (Resl<templ) Res2+=0x00010000; 


temp1=(Res1<<16) &OxFFFFO0000; 
ResO=temp0+temp! ; | 
if (ResO<temp0) Res2++; 


Res2=Res2+( (Res1>>16 ) &OxOO00FFFF )+temp3; 


if (fnLm<0) { 
Res 2=~Res2; 
if (Res0==0) Res2++; 
else Res0=(~Res0)+1; 

} 

if (S==1) { 
ResO=MACL+Res0; 
if (MACL>Res0O) Res2++; 
Res2+= (MACH&OXOOOOFFFF) ; 


if (( (long) Res2<0 ) && (Res2<0xFFFF8000) ) { 
Res2=0x00008000; 
Res0=0x00000000; 

} 

if(( (long) Res2>0) && (Res2>0x00007FFF) ) { 
Res2=0x00007FFF; 
ResO=OXFFFFFFFF ; 

}; 
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MACH=Res2:; 
-MACL=Res0; 
} 
else {. 
Res0=MACL+Res0; 
if (MACL>Res0) Res2++; 


Res 2+=MACH 
MACH=Res2 ;. 
MACL=Res0O; 

} 

PC+=2; 

} 
Example: 
MOVA TBLM, RO - Table address 
MOV RO,R1 | 
MOVA TBLN, RO Table address 
CLRMAC MAC register initialization 
MAC .L @RO+, GR1+ 
MAC.L @RO+, GR1+ 
STS MACL, RO Store result into RO 
align 2 


TBLM .data.1 H'1234ABCD 
.data.] H'5678EF01 
TBLN .data.l H'0123ABCD 
.data.1 H'4567DEFO 
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6.30 MAC (Multiply and Accumulate): Arithmetic Instruction (SH-1 CPU) 


Format Abstract Code State T Bit 
MAC.W @Rm+,@Rn+ With signed, (Rn) x(Rm)+MAC 0100nnnnmmmm1111 3/2) — 
— MAC 


Description: Multiplies 16-bit operands obtained using the contents of general registers Rm and 
Rn as addresses. The 32-bit result is added to contents of the MAC register, and the final result is 
stored in the MAC register. Everytime an operand is read, they increment Rm and Rn by two. 


When the S bit is cleared to 0, the 42-bit result is stored in the coupled MACH and MACL 
registers. Bit 9 data is transferred to the upper 22 bits (bits 31 to 10) of the MACH register. 


When the S bit is set to 1, addition to the MAC register is a saturation operation. For the saturation 
operation, only the MACL register is enabled and the result is limited to a range of H'80000000 
(minimum) to H"7FFFFFFF (maximum). 


If an overflow occurs, the LSB of the MACH register is set to 1. The result is stored in the MACL 
register, and the result is limited to a value between H'80000000 (minimum) for overflows in the 
negative direction and H’7FFFFFFF (maximum) for overflows in the positive direction. 


Note: The normal number of cycles for execution is 3; however, this instruction can be executed 
in two cycles according to the succeeding instruction. 
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6.31 MAC.W (Multiply and Accumulate Word): Arithmetic Instruction 


Format Abstract Code State T Bit 


MAC.W @Rm+,@Rn+ Signed operation, 0100nnnnmmmm1111 3/(2) — 
MAC @Rm+,@Rn+ (Rn) x(Rm) + MAC — MAC 


Description: Signed-multiplicates 16-bit operands obtained using the contents of general registers 
Rm and Rn as addresses. The 32-bit result is added to contents of the MAC register, and the final 
result is stored in the MAC register. Everytime an operand is read, they increment Rm and Rn by 
two. 


When the S bit is cleared to O, the operation is 16 x 16 + 64 — 64-bit multiply and accumulate and 
the 64-bit result is stored in the coupled MACH and MACL registers. 


When the S bit is set to 1, the operation is 16 x 16 + 32 — 32-bit multiply and accumulate and 
addition to the MAC register is a saturation operation. For the saturation operation, only the 
MACL register is enabled and the result is limited to a range of H'80000000 (minimum) to 
H'"7FFFFFFF (maximum). 


If an overflow occurs, the LSB of the MACH register is set to 1. The result is stored in the MACL 
register, and the result is limited to a value between H'80000000 (minimum) for overflows in the 
negative direction and H"7FFFFFFF (maximum) for overflows in the positive direction. 


Note: When the S bit is 0, the SH-2 CPU performs a 16 x 16 + 64 — 64 bit multiply and 
accumulate operation and the SH-1 CPU performs a 16 x 16 + 42 — 42 bit multiply and 
accumulate operation. 


Operation: 


MACW(long m,long n) /* MAC.W @Rm+, @Rn+*/ 
{ 
long tempm, tempn,dest,src,ans; 
unsigned long templ; 
tempn=(long)Read_Word(R[n]); 
R[n]+=2; 
tempm=(long)Read_Word(R{[m] ); 
R[m]+=2; 
temp1=MACL; 
tempm=( (long) (short) tempn* (long) (short) tempm) ; 
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if ((long)MACL>=0) dest=0; 
else dest=1; _ | 


if ((long)tempm>=0 { 


src=0; 
-tempn=0; 
} 
else { 
sro=1; 
tempn=0OxFFFFFFFF ; 
} 
srct=dest; 
MACLt+=tempm ; 


if ((long)MACL>=0) ans=0; 
else ans=1; 

ans+=dest; 

if (S==1) { 


if (ans==1) { 


PCCHHCESSHOHHHASCEAHTAHAECE HEH SEHCHKCHOSAHRAOEHHCOESSHHHAHCHHORHTUTHHHEHESHEHESCHHAHSHHASRRHSEHEHSEKS 
s 


if (srce==0 || src==2) 
MACH |=0x00000001; . 


if (src==0) MACL=0x7FFFFFFF; 
if (src==2) MACL=0x80000000; 


} 
else { 
MACH+=tempn ; 
if (templ>MACL) MACH+=1; 


if ((MACH&0x00000200)==0) 
MACH&=0x000003FF; 
else MACH|=0OxFFFFFCOO; 
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For SH-1 CPU (these 2 lines 
not needed for SH-2 CPU) 


For SH-1 CPU (these 3 lines 
not needed for SH-2 CPU) 


Example: 


TBLM 


TBLN 


.data.w 


.data.w 


TBLM, RO 
RO,R1 
TBLN, RO 


@RO+, GR1+ 
@RO+, @R1+ 
MACL, RO 


H'1234 
H'5678 
H'0123 
H'4567 


Table address 
Table address 


MAC register initialization 


Store result into RO 
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6.32 MOV (Move Data): Data Transfer Instruction 





Format Abstract 3 -— Code — State T Bit 
MOV Rm,Rn Rm — Rn 0110nnnnmmmm0011 = — 
MOV.B Rm, @Rn Rm = (Rn) } 0010nnnnmmmm0000 1 - 
MOV.W Rm, @Rn Rm —> (Rn) | 0010nnnnmmmm0001 _~—s 1 — 
MOV.L Rm, @Rn Rm — (Rn) 0010nnnnmmmm0010 = 1 — 
MOV.B @Rm,Rn (Rm) — sign extension — Rn 0110nnnnmmmm0000 ~~ 1 — 
MOV.W @Rm,Rn (Rm) — sign extension — Rn 0110nnnnmmmm0001 = 1 — 
MOV.L @Rm,Rn (Rm) > Rn 0110nnnnmmmm0010 = 1 — 
MOV.B Rm, @—Rn Rn —1— Rn, Rm > (Rn) 0010nnnnmmmm0100 ~—=—s 1 — 
MOV.W Rm, @—Rn Rn —2 — Rn, Rm — (Rn) 0010nnnnmmmm0101 = 1 — 
MOV.L Rm, @—Rn Rn — 4 — Rn, Rm —> (Rn) 0010nnnnmmmm0110 = 1 — 
MOV.B @Rm+,Rn (Rm) — sign extension > Rn, Rm 0110nnnnmmmm0100 1 — 
+1—Rm 
MOV.W @Rm+,Rn (Rm) — sign extension > Rn, Rm 0110nnnnmmmm0101 _~— 1 — 
+2— Rm 
MOV.L @Rm+,Rn (Rm) > Rn, Rm + 4— Rm 0110nnnnmmmm0110 = =1 — 
MOV.B Rm, @(RO,Rn) Rm — (RO + Rn) 000Onnnnmmmm0100~=—=s 1 — 
MOV.W Rm, @(RO, Rn) Rm — (RO + Rn) 0000Onnnnmmmm0101 ~—s 1 — 
MOV.L Rm, @(RO,Rn) Rm — (RO + Rn) 000Onnnnmmmm0110 = 1 — 
MOV.B @(RO,Rm),Rn (RO+Rm) > sign extension > 0000nnnnmmmm1100 = 1 — 
Rn 0000nnnnmmmm1101 = 1 — 
MOV.W @(RO,Rm),Rn Ly + Rm) — sign extension oo00nnnnmmmm1110 1 = 


MOV.L @(RO,Rm),Rn (RO+Rm)—Rn 


Description: Transfers the source operand to the destination. When the operand.is stored in 
memory, the transferred data can be a byte, word, or longword. When the source operand is in 
memory, loaded data from memory is stored in a register after it is sign-extended to a longword. 


Operation: 


MOV(long m,long n) /* MOV Rm,Rn */ 
{ 

R[n]=R[{m]; 

PC+=2; 
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MOVBS (long m,long n) /* MOV.B Rm, @Rn */ 
{ 
Write_Byte(R[n],R{m]); 


PCt+=2; 
} 
MOVWS (long m,long n) /* MOV.W Rm, @Rn */ 
| 
Write_Word(R[n],R[m]); 
PC+=2; 
} 
MOVLS (long m, long n) /* MOV.L Rm, @Rn */ 
{ 
Write_Long(R[n],R[m] ); 
PC+=2; 
} 
MOVBL(long m,long n) /* MOV.B @Rm,Rn */ 
{ 
R[n]=(long)Read_Byte(R[m] ); 
if ((R{n]&0x80)==0) R[n]&0x000000FF; 
else R[n] |=OxFFFFFFOO; 
PC+=2; 
} 
MOVWL( long m,long n) /* MOV.W @Rm,Rn */ 
{ 
R[{n]=(long)Read_Word(R[m] ); 
if ((R[{n]&0x8000)==0) R[n] &0xOO00FFFF; 
else R[n] |=OxFFFFO000; 
PCt=2; 
} 
MOVLL(long m,long n) /* MOV.L @Rm,Rn */ 
{ 
R[n]=Read_Long(R[m]); 
PC+=2; 
} 
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MOVBM(long m,long n) /* MOV.B Rm, @—Rn */ 
{ 
Write_Byte(R[n]-1,R[m]); 


R[n]-—=1; 
PC+=2; 
} 
MOVWM(long m,long n) /* MOV.W Rm, @—Rn */ 
| 
Write_Word(R[n]-2,R[m]); 
R[n]-=2; 
PC+=2; 
} 
MOVLM(long m,long n) /* MOV.L Rm, @—Rn */ 
{ 
Write_Long(R[{n]-—4,R[m]); 
R[n]-=4; 
PC+=2; 
} 


MOVBP(long m,long n) /* MOV.B @Rmt+t,Rn */ 
{ 
R[n]=(long)Read_Byte(R[m] ); 
if ((R[n]&0x80)==0) R[n]&0x000000FF; 
else R[n] |=OxFFFFFFOO; 
if (n!=m) R[m]+=1; 
PC+=2; 
} 


MOVWP( long m,long n) /* MOV.W @Rm+,Rn */ 
{ 
R[n]=(long)Read_Word(R[m]); 
if ((R[n]&0x8000)==0) R[n] &0xO000FFFF; 
else R[n] |=OxFFFFOOOO; 
if (n!=m) R[m]+=2; 
PC+=2; 
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MOVLP(long m,long n) /* MOV.L @Rm+,Rn */ 
{ 

R[n]=Read_Long(R[m] ); 

if (n!=m) R[m]+=4; 


PCt+=2; 
} 
MOVBS0(long m,long n) /* MOV.B Rm,@(RO,Rn) */ 
| 
Write_Byte(R[n]+R[0],R[m] ); 
PCt+=2; 
} 
MOVWSO0 (long m,long n) /* MOV.W Rm,@(RO,Rn) */ 
{ 
Write_Word(R[n]+R[0],R[m] ); 
PC+=2; | 
} 


MOVLSO(long m,long n) /* MOV.L Rm,@(RO,Rn) */ 
{ 

Write_Long(R[n]+R[0],R[m] ); 

PC+=2; 
} 


MOVBLO(long m,long n) /* MOV.B @(RO,Rm),Rn */ 
{ 

R[n]=(long)Read_Byte(R[m]+R[0]); 

if ((R[n]&0x80)==0) R[n]&OxO0O0000FF; 

else R[n] |=OxFFFFFFOO; 

PCt=2; 
} 


MOVWLO(long m,long n) /* MOV.W @(RO,Rm),Rn */ 
| 
R[n]=(long)Read_Word(R[m]+R[0]); 
if ((R[n]&0x8000)==0) R[n] &0x0000FFFF; 
else R[n] |=OxFFFFOOOO; 
PC+=2; 
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MOVLLO(long m,long n) /* MOV.L @(RO,Rm),Rn */ 


{ 
R[n]=Read_Long(R[mJ+R[0]); 


PCt+=2;— 
} 
Example: 

MOV RO,R1 Before execution 
After execution 

MOV.W RO, @R1 Before execution 
After execution 

MOV.B @RO,R1 Before execution 
After execution 

MOV.W RO, @—-R1 Before execution 
After execution 

MOV.L @RO+,R1 Before execution 
After execution 

MOV.B R1,@(RO,R2) Before execution 
After execution 

MOV.W @(RO,R2),R1 Before execution 


After execution 


RO = H'FFFFFFFF, R1 = H'00000000 
R1 = H'FFFFFFFF 


RO = H'FFFF7F80 
@R1 = H'7F80 


@RO = H'80, R1 = H'00000000 
R1 = H'FFFFFF80 


RO = HAAAAAAAA, R1 = H'FFFF7F80 
Rl = H'FFFF7E7E, @R1 = H'AAAA 


RO = H'12345670 
RO = H'12345674, R1 = @H'12345670 


R2 = H'00000004, RO = H'10000000 
R1 = @H'10000004 


R2 = H'00000004, RO = H'10000000 
R1 = @H'10000004 
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6.33 MOV (Move Immediate Data): Data Transfer Instruction 


Format Abstract Code State TBit 

MOV #imm, Rn imm — sign extension > Rn 1110nnnniiiiiiii 1 — 

MOV.W @(disp,PC),Rn (disp x 2 + PC) > sign 1001innnndddddddd_ 1 — 
extension — Rn 

MOV.L G@(disp,PC),Rn (disp x 4+ PC) — Rn 1101nnnndddddddd_ 1 — 


Description: Stores immediate data, which has been sign-extended to a longword, into general 
register Rn. 


If the data is a word or longword, table data stored in the address specified by PC + displacement 
is accessed. If the data is a word, the 8-bit displacement is zero-extended and doubled. 
Consequently, the relative interval from the table is up to PC + 510 bytes. The PC points to the 
starting address of the second instruction after this MOV instruction. If the data is a longword, the 
8-bit displacement is zero-extended and quadrupled. Consequently, the relative interval from the 
table is up to PC + 1020 bytes. The PC points to the starting address of the second instruction after 
this MOV instruction, but the lowest two bits of the PC are corrected to B’00. 


Note: The end address of the program area (module) or the second address after an unconditional 
branch instruction are suitable for the start address of the table. If suitable table assignment is 
impossible (for example, if there are no unconditional branch instructions within the area specified 
by PC + 510 bytes or PC + 1020 bytes), the BRA instruction must be used to jump past the table. 
When this MOV instruction is placed immediately after a delayed branch instruction, the PC 
points to an address specified by (the starting address of the branch destination) + 2. 


Operation: 


MOVI(long i,long n) /* MOV #imm,Rn */ 

{ 
if ((i&0x80)==0) R[{n]=(Ox000000FF & (long)i); 
else R[n]=(OxFFFFFFOO | (long)i); 


PC+=2; 
} 
MOVWI (long d,long n) /* MOV.W @(disp,PC),Rn */ 
{ 

long disp; 
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disp=(0x000000FF & (long)d) ; | 

R[n]= (long)Read_| Word (PC+(disp<<1)); 

if ((R[n]&0x8000)==0) R[n]&=0x0000FFFF; 
else R[n]|=OxFFFF0000; 


PC+=2; 
} 
MOVLI (long d, long n) /* MOV.L @(disp, PC) ,Rn */ 
| 
long disp; 
disp=(Ox000000FF & (long)d); 
R[n]=Read_Long( (PC&0xFFFFFFFC ) + (disp<<2) ) ; 
PCt+=2; 
Zz 
Example: 
Address 
1000 MOV #H'80,R1 R1 = H'FFFFFF80 
1002. ‘MOV.W IMM, R2 R2 = H'FFFF9ABC, IMM means @(H'08,PC) 
1004 ADD #-1, RO | : a 
1006 TST RO,RO < PC location used for address calculation for the 
| MOV.W instruction 
1008 MOVT R13 
100A BRA NEXT Delayed branch instruction 
100C MOV.L . @(4,PC),R3 R3 = H'12345678 
100E IMM .data.w  H'9ABC 
1010 .data.w H'1234 
1012 NEXT JMP @R3 Branch destination of the BRA instruction © 
1014 CMP/EQ #0,RO < PC location used for address calculation for the 
MOV.L instruction 
,align 4 
1018 .data.l H'12345678 
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6.34 MOV (Move Peripheral Data): Data Transfer Instruction 


Format Abstract Code State T Bit 


MOV.B @(disp,GBR),RO (disp + GBR) —> sign 11000100dddddddd 1 — 
extension — RO 


MOV.W @(disp,GBR) , RO (disp x2 + GBR) > 11000101dddddddd 1 — 
sign extension — RO 


MOV.L @(disp,GBR) ,RO (disp x4+ GBR) — RO 11000110dddddddd 
MOV.B RO,@(disp,GBR) RO- > (disp + GBR) 11000000dddddddd 
MOV.W RO,@(disp,GBR) RO- (disp x2+GBR) 11000001dddddddd 
MOV.L R0O,@(disp,GBR) RO—- (disp x4+GBR) 11000010dddddddd 


a a ee 


Description: Transfers the source operand to the destination. This instruction is suitable for 
accessing data in the peripheral module area. The data can be a byte, word, or longword, but the 
register is fixed to RO. 


A peripheral module base address is set to the GBR. When the peripheral module data is a byte, 
the 8-bit displacement is zero-extended. Consequently, an address within +255 bytes can be 
specified. When the peripheral module data is a word, the 8-bit displacement is zero-extended and 
doubled. Consequently, an address within +510 bytes can be specified. When the peripheral 
module data is a longword, the 8-bit displacement is zero-extended and is quadrupled. 
Consequently, an address within +1020 bytes can be specified. If the displacement is too short to 
reach the memory operand, the above @(RO,Rn) mode must be used after the GBR data is 
transferred to a general register. When the source operand is in memory, the loaded data is stored 
in the register after it is sign-extended to a longword. 


Note: The destination register of a data load is always RO. RO cannot be accessed by the next 
instruction until the load instruction is finished. Changing the instruction order shown in figure 6.1 
will give better results. 


MOV.B @(12, GBR), R MOV.B @(12, GBR), RO 


AND ~ #80, RO i ADD #20, R1 
ADD #20,R1 AND ~ #80, RO 


Figure 6.1 Using RO after MOV 





99 


Operation: 


MOVBLG(long d) /* MOV.B @(disp,GBR),RO */ 
[ ss : 
long disp; 


disp=(OxO000000FF & (long)d); 
R[0]=(1long) Read_Byte(GBRt+disp) ; 
if ((R[0]&0x80)==0) R[0]&=0x000000FF; 
else R[0] |=OxFFFFFFOO; 
PC+=2; 

} 


MOVWLG(long d) /* MOV.W @(disp,GBR),RO */ 
long disp; 


Gisp=(O0x000000FF & (long)d); 
-R[0]=(long)Read_Word(GBR+(disp<<1) ); 
if ((R[0]&0x8000)==0) R[0]&=Ox0000FFFF; 
else R[O] |=0xFFFFO000; 
PC+=2; | 

} 


MOVLLG(long d) /* MOV.L @(disp,GBR),RO */ 
{ 
| long disp; 


disp=(Ox000000FF & (long)d); 
R[0]=Read_Long(GBR+ (disp<<2) ); 
PC+=2; 

} 


MOVBSG(long d) /* MOV.B RO, @(disp,GBR) */ 


{ 
long disp; 


100 


disp=(OxO00000FF & (long)d); 
Write_Byte(GBR+disp,R[0]); 
PC+=2; 

} 


MOVWSG( long d) /* MOV.W RO, @(disp,GBR) */ 
{ 
long disp; 


disp=(Ox000000FF & (long)d) ; 
Write_Word (GBR+(disp<<1),R[0]); 
PC+=2; 

} 


MOVLSG(long d) /* MOV.L RO, @(disp,GBR) */ 


{ 
long disp; 


disp=(OxOOOO0OOFF & (long)d); 
Write_Long(GBR+(disp<<2),R[0]); 
PC+=2; 

} 


Examples: 


MOV.L @(2,GBR),RO Before execution @(GBR + 8) = H'12345670 
After execution RO = @H'12345670 


MOV.B RO, @(1,GBR) Before execution RO = H'FFFF7F80 
After execution @(GBR + 1) = H'FFFF7F80 
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6.35 MOV (Move Structure Data): Data Transfer Instruction | 


Format Abstract Code State .T Bit 
MOV.B RO, @(disp,Rn) RO — (disp + Rn) 10000000nnnndddd 1 — 
MOV.W RO,@(disp,Rn) RO->» (disp x2+ Rn) 10000001nnnndddad~—Ss 1 — 
MOV.L kRm,@(disp,Rn) Rm-—>(disp x4+Rn)  0001nnnnmmmmdddd 1 — 
MOV.B @(disp,Rm),RO (disp + Rm) —- sign 10000100mmmmdddd 1 — 
extension — RO 
MOV.W @(disp,Rm),RO (disp x 2+ Rm) > sign 10000101mmmmdddd 1 — 
extension — RO . a . 
MOV.L @(disp,Rm),Rn (disp x 4+ Rm) > Rn 0101nnnnmmmmdddd 1 — 


Description: Transfers the source operand to the destination. This instruction is suitable for 
accessing data in a structure or a stack. The data can be a byte, word, or longword, but when a byte 
or word is selected, only the RO register is fixed. When the data is a byte, the 4-bit displacement is 
zero-extend. Consequently, an address within +15 bytes can be specified. When the data is a word, 
the 4-bit displacement is zero-extended and doubled. Consequently, an address within +30 bytes 
can be specified. When the data is a longword, the 4-bit displacement is zero-extended and 
quadrupled. Consequently, an address within +60 bytes can be specified. If the displacement is too 
short to reach the memory operand, the aforementioned @(RO,Rn) mode must be used. When the 
source operand is in memory, the loaded data is stored in the register after it is sign-extended to a 
longword. 


Note: When byte or word data is loaded, the destination register is always RO. RO cannot be 
accessed by the next instruction until the load instruction is finished. Changing the instruction 
order in figure 6.2 will give better results. 


MOV.B @(2,R1); RO ~ MOV.B @(2, R1), RO 


~ AND #80, RO SS ADD #20, R1 
ADD #20, R1 AND #80, RO 


Figure 6.2 Using RO after MOV 
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Operation: 


MOVBS4 (long d,long n) /* MOV.B RO,@(disp,Rn) */ 
{ 
long disp; 


disp=(QOxO0O000000F & (long)d); 
Write_Byte(R[n]+disp,R[0]); 
PC+=2; 

} 


MOVWS4 (long d,long n) /* MOV.W RO,@(disp,Rn) */ 
{ 
long disp; 


disp=(O0x0000000F & (long)d); 
Write_Word(R[n]+(disp<<1),R[0]); 
PC+=2; 

} 


MOVLS4 (long m,long d,long n) 
/* MOV.L Rm,@(disp,Rn) */ 


long disp; 


disp=(Ox0O000000F & (long)d); 
Write _Long(R[n]+(disp<<2),R[m]); 
PC+=2; 

} 


MOVBL4 (long m,long d) /* MOV.B @(disp,Rm),RO */ 
{ 
long disp; 


disp=(0x0000000F & (long)d); 
R[0]=Read_Byte(R[m]+disp) ; 

if ((R[0]&0x80)==0) R[0]&=Ox000000FF;; 
else R[0] |=OxFFFFFFOO; 

PCt+=2; 
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MOVWL4(long m,long d) /* MOV.W @(disp,Rm),RO */ 


{ 
long disp; 


disp=(OxO0O00000F & (long)d); 
R[0]=Read_Word(R[m]+(disp<<1) ); 

if ((R[0]&0x8000)==0) R[0]&=0x0000FFFF; 
else R[0] |=OxFFFFO000; 


PC+=2; 
} 


MOVLL4 (long m,long d,long n) 
/* MOV.L @(disp,Rm),Rn */ 


long disp; 


disp=(0x0000000F & (long)d); 


R[n]=Read_Long(R[m]+(disp<<2) ); 


PC+=2; 
} 


Examples: 


MOV.L @(2,R0),R1 


MOV.L RO,@(H'F,R1) 


Before execution @(RO + 8) = H'12345670 
After execution R1 = @H'12345670 


Before execution RO = H'FFFF7F80 
After execution @(R1 + 60) = H'FFFF7F80 
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6.36 MOVA (Move Effective Address): Data Transfer Instruction 


Format Abstract Code State T Bit 
MOVA @(disp,PC),RO disp x 4+ PC — RO 11000111dddddddd 1 —. 


Description: Stores the effective address of the source operand into general register RO. The 8-bit 
displacement is zero-extended and quadrupled. Consequently, the relative interval from the 
operand is PC + 1020 bytes. The PC points to the starting address of the second instruction after 
this MOVA instruction, but the lowest two bits of the PC are corrected to B’00. 


Note: If this instruction is placed immediately after a delayed branch instruction, the PC must 
point to an address specified by (the starting address of the branch destination) + 2. 


Operation: 


MOVA(long d) /* MOVA @(disp,PC),RO */ 


{ 
long disp; 


disp=(Ox000000FF & (long)d); 
R[0O]=(PC&OXFFFFFFFC ) + (disp<<2); 


PC+=2; 
} | 
Example: 

Address Org H'1006 

1006 MOVA STR, RO Address of STR > RO 

1008 MOV .B @RO,R1 R1 =“X” <— PC location after correcting the lowest 
two bits 

100A ADD R4,R5 <- Original PC location for address calculation for 
the MOVA instruction 

.align A 

100C STR: .sdata “XYZP12” 

2002 BRA TRGET Delayed branch instruction 

2004 MOVA @(0,PC),RO Address of TRGET + 2 — RO 


2006 NOP 
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6.37 MOVT (Move T Bit): Data Transfer Instruction 
Format —s—| Abstract | Code State TBit — 
MOVE Rn T+ Rn | | 0000nnnn00101001 1 cs: 


Description: Stores the T bit value into general register Rn. When T = 1, 1 is stored in Rn, and 
when T = 0, 0 is stored in Rn. . 


Operation: 


MOVT(long n) /* MOVT Rn */ 
{ 
R[n]=(0x00000001 & SR); 


PC+=2; 
} 
Example: 

XOR R2,R2 R2=0 

CMP/PZ R2 T=1 

MOVT RO RO=1 

CLRT T=0 
=0 


MOVT R1 R1 
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6.38 MUL.L (Multiply Long): Arithmetic Instruction (SH-2 CPU) 


Format Abstract : Code | State T Bit 
MUL.L Rm,Rn Rn x Rm > MACL 0000nnnnmmmm0111 2to4 — 


Description: Performs 32-bit multiplication of the contents of general registers Rn and Rm, and 
stores the lower 32 bits of the result in the MACL register. The MACH register data does not 
change. | 


Operation: 


MULL(long m,long n) /* MUL.L Rm,Rn */ 
{ 

MACL=R[n]*R[m] ; 

PC+=2; 
} 


Example: 


MUL.L RO,R1 Before execution RO=H'FFFFFFFE, R1 = H'00005555 
After execution MACL = H'FFFF5556 
STS MACL, RO Operation result 
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6.39 MULS.W (Multiply as Signed Word): Arithmetic Instruction 


Format > = Abstract . 3 Code er ee State T Bit. 
MULS.W  Rm,Rn Signed operation, RnxRm-—> 0010nnnnmmmm1111 1to3 — 
MULS Rm, Rn MACL | | | 


Description: Performs 16-bit multiplication of the contents of general registers Rn and Rm, and 
stores the 32-bit result in the MACL register. The operation is signed and the MACH register data 
does not change. | | 


Operation: 


MULS(long m,long n) /* MULS Rm,Rn */ 

{ 
MACL=( (long) (short)R[n]*(long) (short )R[m] ); 
PC+=2; 

} 


Example: 


MULS RO,R1  Beforeexecution RO =H'FFFFFFFE, R1 = H'00005555 
After execution MACL = H'FFFF5556 
STS MACL,RO Operation result 
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6.40 MULU.W (Multiply as Unsigned Word): Arithmetic Instruction 


Format Abstract | Code State T Bit 


MULU.W  Rm,Rn Unsigned, Rn x Rm — MAC 0010nnnnmmmm1110 1to3 — 
MULU Rm, Rn 


Description: Performs 16-bit multiplication of the contents of general registers Rn and Rm, and 
stores the 32-bit result in the MACL register. The operation is unsigned and the MACH register 
data does not change. 


Operation: 


MULU(long m,long n) /* MULU Rm,Rn */ 
{ 
MACL=( (unsigned long) (unsigned short)R[n] 
*(unsigned long) (unsigned short)R[m]); 
PCt+=2; 
} 


Example: 


MULU  RO,R1 Before execution RO = H'00000002, R1 = H'FFFFAAAA 
After execution MACL = H'00015554 
STS MACL,RO Operation result 
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6.41 NEG (Negate): Arithmetic Instruction  _ 
Format OG Abstract , Code sd State TBit 
NEG  Rm,Rn O—Rm—>Rn -. 011 0nnnnmmmm1011 1 = 


Description: Takes the two’s complement of data in general register Rm, and stores the an in 
Rn. This effectively subtracts Rm data from 0, and stores the result in Rn. » 


Operation: 


NEG(long m,long n) /7* NEG Rm,Rn */ 
{ 


R[n]=0-R[m] ; 
PC+=2; 
} 
Example: 
NEG RO,R1 Before execution RO = H'00000001 


After execution R1 = H'FFFFFFFF 
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6.42 NEGC (Negate with Carry): Arithmetic Instruction 


Format Abstract Code 7 State T Bit 
NEGC Rm, Rn 0-—Rm-T-— Rn, Borrow > T 0110nnnnmmmm1010 1 Borrow 


Description: Subtracts general register Rm data and the T bit from 0, and stores the result in Rn. 
If a borrow is generated, T bit changes accordingly. This instruction is used for inverting the sign 
of a value that has more than 32 bits. 


Operation: 


NEGC(long m,long n) =/* NEGC Rm,Rn */ 
{ 


unsigned long temp; 


temp=0-R[m] ; 
R[{n]=temp-tT; 
if (0<temp) T=1; 
else T=0; 
if (temp<R[n]) T=1; 
PC+=2; 

} 


Examples: 


CLRT Sign inversion of R1 and RO (64 bits) 

NEGC  R1,R1 Before execution R1 =H'00000001, T=0 
After execution R1 = H'FFFFFFFF, T = 1 

NEGC  RO,RO Before execution RO=H'00000000, T= 1 
After execution RO = H'FFFFFFFF, T = 1 
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6.43 NOP (No Operation): System Control Instruction 


Format , Abstract Code 
NOP No operation 0000000000001001 


Description: Increments the PC to execute the next instruction. 
Operation: 


NOP() /* NOP */ 


{ 
PC+=2; 


} 
Example: 


NOP Executes in one cycle 
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State T Bit 


1 


6.44 NOT (NOT—Logical Complement): Logic Operation Instruction 


Format Abstract Code State T Bit 
NOT Rm, Rn ~Rm — Rn 0110nnnnmmmm0111 1 — 


Description: Takes the one’s complement of general register Rm data, and stores the result in Rn. 
This effectively inverts each bit of Rm data and stores the result in Rn. 


Operation: 


NOT(long m,long n) /* NOT Rm,Rn */ 
{ 


R[n]=~R[m] ; 
PCt+=2; 
} 
Example: 


NOT RO,R1 Before execution RO=H'AAAAAAAA 
After execution R1 =H'55555555 
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6.45 OR (OR Logical) Logic Operation Instruction 


Format ws Abstract | Code _ State TBit — 
OR  Rm,Rn Rn | Rm— Rn 0010nnnnmmmm1011 1 — 
OR  #imm,RO RO! imm + RO | 11001011iiiiiiii 1 = 
— OR.B #imm,@(RO,GBR) (RO+GBR)limm—(RO+ 11001111iiiii315 3 = — 
| | GBR) | . we, 





Description: Logically ORs the contents of general registers Rn and Rm, and stores the result in 
Rn. The contents of general register RO can also be ORed with zero-extended 8-bit immediate 


data, or 8-bit memory data accessed by using indirect indexed GBR addressing can be ORed with 
8-bit immediate data. 


Operation: 


OR(long m,long n) /* OR Rm,Rn */ 
{ 

R[n]]=R[m]; 

PCt+=2; 
} 


ORI(long i) /7* OR #imm,RO */. 
{ 
RO] |=(OxO00000FF & (1long)i); 
PC+=2; 
} 


ORM(long i) /* OR.B #imm, @(RO,GBR) */ 
{ 
long temp; 


temp=(long)Read_Byte(GBR+R[0] ); 
temp|=(OxOO0000FF & (long)i); 
Write_Byte(GBR+tR[0],temp) ; 
PCt=2; 
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Examples: 


OR RO,R1 Before execution RO = H'AAAASSSS, R1 = H'55550000 
After execution R1 = H'FFFF5555 | 

OR #H'FO,RO Before execution RO = H'00000008 
After execution RO = H'OO0000F8 


OR.B  #H'50,@(RO,GBR) Before execution @(RO,GBR) = H'AS 
After execution @(RO,GBR) = H'F5S 
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6.46 ROTCL (Rotate with Carry Left): Shift Instruction 


Format —. Abstract Code State T Bit 
ROTCL Rn TeRneT 0100nnnn00100100 1 MSB 


Description: Rotates the contents of general register Rn and the T bit to the left by one bit, and 
stores the result in Rn. The bit that is shifted out of the operand is transferred to the T bit 
(figure 6.3). 





Figure 6.3 Rotate with Carry Left 


Operation: 


ROTCL(long n) /* ROTCL Rn */ 


{ 
long temp; 


if ((R{n]&0x80000000)==0) temp=0; 
else temp=1; 

R[n]<<=1; 

if (T==1) R[n] |=0x00000001; 

else R[n]&=OxFFFFFFFE; 

if (temp==1) T=1; 


else T=0; 
PC+=2; 
} 
Example: 


ROTCL RO Before execution RO = H'80000000, T = 0 
After execution RO = H'00000000, T = 1 
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6.47 ROTCR (Rotate with Carry Right): Shift Instruction 


Format Abstract Code State T Bit 
ROTCR Rn To RnoT 0100nnnn00100101 1 LSB 


Description: Rotates the contents of general register Rn and the T bit to the right by one bit, and 
stores the result in Rn. The bit that is shifted out of the operand is transferred to the T bit 
(figure 6.4). 





Figure 6.4 Rotate with Carry Right 


Operation: 


ROTCR(long n) /* ROTCR Rn */ 


{ 
long temp; 


if ((R[{n]&0x00000001)==0) temp=0; 
else temp=1; 

R[n)>>=1; 

if (T==1) R[n]]=0x80000000; 

else R[n] &=OX7FFFFFFF; 

if (temp==1) T=1; 


else T=0; 
PCt+=2; 
} 
Examples: 


ROTCR RO Before execution RO = H'00000001, T = 1 
After execution RO = H'80000000, T = 1 
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6.48 ROTL (Rotate Left): Shift Instruction 
Format ot he Abstract a Code : State T Bit 
ROTL Ro T —Rn — MSB 0100nnnn00000100 { MSB 


Description: Rotates the contents of general register Rn to the left by one bit, and stores the result 
in Rn (figure 6.5). The bit that is shifted out of the operand is transferred to the T bit. 


MSB LSB 





ROTL _ ee. 


Figure 6.5 Rotate Left 
Operation: 


ROTL(long n) /* ROTL Rn */ 

{ 
if ((R[n]&0x80000000)==0) T=0; 
else T=1; 
R[n]<<=1; | 
if (T==1) R[n]|=0x00000001; 
else R[n]&=OxFFFFFFFE; 
PC+=2; 

} 


Examples: 


ROTL RO Before execution RO = H'80000000, T = 0 
After execution RO = H'00000001, T = 1 
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6.49 ROTR (Rotate Right): Shift Instruction 


Format Abstract Code State T Bit 
ROTR Rn LSB — Rn > T 0100nnnn00000101 1 LSB 


Description: Rotates the contents of general register Rn to the right by one bit, and stores the 
result in Rn (figure 6.6). The bit that is shifted out of the operand is transferred to the T bit. 





Figure 6.6 Rotate Right 


Operation: 


ROTR(long n) /* ROTR Rn */ 

{ 
if ((R[{n]&0x00000001)==0) T=0; 
else T=1; 
R({n]>>=1; 
if (T==1) R[n] |=0x80000000; 
else R[n] &=OxX7FFFFFFF; 
PC+=2; 


} 
Examples: 


ROTR RO Before execution RO = H'00000001, T = 0 
After executionRO = H'80000000, T = 1 
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6.50 RTE (Return from Exception): System Control Instruction 


Class: Delayed branch instruction 


Format _ Abstract Code State T Bit 





RTE Stack area — PC/SR 0000000000101011 4 LSB 





Description: Returns from an interrupt routine. The PC and SR values are restored from the stack, 
and the program continues from the address specified by the restored PC value. 


Note: Since this is a delayed branch instruction, the instruction after this RTE is executed before 
branching. No address errors and interrupts are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 


RTE() /* RTE */ 
{ 


unsigned long temp; 


temp=PC ; 

PC=Read_Long(R[15] )+4; 
R[15]+=4; 

SR=Read_Long(R[15] )&0x000003F3; 
R[15)+=4; 

Delay_Slot(temp+t2) ; 


} 
Example: 

RTE Returns to the original routine 
ADD #8,R14 Executes ADD before branching 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. — 
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6.51 RTS (Return from Subroutine): Branch Instruction — 


Class: Delayed branch instruction 


Format | Abstract _ Code State T Bit 
RTS PR —> PC 0000000000001011 2 — 


Description: Returns from a subroutine procedure. The PC values are restored from the PR, and 
the program continues from the address specified by the restored PC value. This instruction is used 
to return to the program from a subroutine program called by a BSR or JSR instruction. 


Note: Since this is a delayed branch instruction, the instruction after this RTS is executed before 
branching. No address errors and interrupts are accepted between this instruction and the next 
instruction. If the next instruction is a branch instruction, it is acknowledged as an illegal slot 
instruction. 


Operation: 


RTS() /* RTS */ 
{ 


unsigned long temp; 


temp=PC ; 
PC=PR+4 ; 
Delay_Slot(temp+2) ; 
} 
Example: 
MOV.L  TABLE,R3 R3 = Address of TRGET 
JSR @R3 Branches to TRGET 
NOP Executes NOP before JSR 
ADD RO,R1 < Return address for when the subroutine procedure is 
completed (PR data) 
TABLE: .data.1 TRGET Jump table 
TRGET: MOV R1,R0 < Procedure entrance 
RTS PRdata—>PC 
MOV #12,R0 Executes MOV before branching 


Note: With delayed branching, branching occurs after execution of the slot instruction. 
However, instructions such as register changes etc. are executed in the order of delayed 
branch instruction, then delay slot instruction. For example, even if the register in which the 
branch destination address has been loaded is changed by the delay slot instruction, the 
branch will still be made using the value of the register prior to the change as the branch 
destination address. 
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6.52 SETT (Set T Bit): System Control Instruction | ae i 
Format Abstract Code State TBit 
SETT 1—>T , 0000000000011000 1 1 





_ Description: Sets the T bit to 1. 
Operation: 


SETT() /* SETT */ 
{ 


T=1; 
—PC+=2; 
} 
Example: 


SETT Before execution T=0O 
After execution T= 1 
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6.53 SHAL (Shift Arithmetic Left): Shift Instruction 


Format Abstract Code State T Bit 
SHAL Rn Te RneO 0100nnnn00100000 1 MSB 


Description: Arithmetically shifts the contents of general register Rn to the left by one bit, and 
stores the result in Rn. The bit that is shifted out of the operand is transferred to the T bit 
(figure 6.7). 





Figure 6.7 Shift Arithmetic Left 


Operation: 


SHAL(long n) /* SHAL Rn (Same as SHLL) */ 


{ 
if ((R[n]&0x80000000)==0) T=0; 


else T=1; 
R[n]<<=1; 
PCt+=2; 
} 
Example: 


SHAL RO Before execution RO=H'80000001, T=0 
After execution RO = H'00000002, T = 1 
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6.54 SHAR (Shift Arithmetic Right): Shift Instruction 
Format | Abstract . * Code oc State T Bit 
SHAR Rn MSB > Rn > T. 0100nnnn00100001 1 LSB 


Description: Arithmetically shifts the contents of general register Rn to the right by one bit, and 
stores the result in Rn. The bit that is shifted out of the operand is transferred to the T bit (figure 
6.8). 





Figure 6.8 Shift Arithmetic Right 


Operation: 


SHAR(long n) /* SHAR Rn */ 
{ 
long temp; 


if ((R{n]&0x00000001)==0) T=0; 
else T=1; 
if ((R[n]&0x80000000)==0) temp=0; 
else temp=1; 
R[n]>>=1; 
if (temp==1) R[n]]=0x80000000; 
else R[n]&=0x7FFFFFFF; 7 
PC+=2; 

} 


Example: 


SHAR ~~ RO Before execution RO = H'80000001, T = 0 
After execution RO = H'C0000000, T = 1 
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6.55 SHLL (Shift Logical Left): Shift Instruction 


Format Abstract Code State T Bit 
SHLL Rn TeRn<O 0100nnnn00000000 1 MSB 


Description: Logically shifts the contents of general register Rn to the left by one bit, and stores 
the result in Rn. The bit that is shifted out of the operand is transferred to the T bit (figure 6.9). 





Figure 6.9 Shift Logical Left 


Operation: 


SHLL(long n) /* SHLL Rn (Same as SHAL) */ 


{ 
if ((R{n]&0x80000000)==0) T=0; 


else T=1; 
R[{n]<<=1; 
PC+=2; 

} 

Examples: 


SHLL RO Before execution RO = H'80000001, T = 0 
After execution RO = H'00000002, T = 1 
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6.56 SHLLn (Shift Logical Left n Bits): Shift Instruction 





Format Be Abstract . Code oo | State T Bit. 
SHLL2 Rn  Rn<<2—4Rn | ~ 100nnnn00001000 1 — 
SHLL8 Rn Rn<<8—>Rn a 0100nnnn00011000 1-0 — 





SHLL16 Rn Rn << 16 — Rn 0100nnnn00101000 1 — 


Description: Logically shifts the contents of general register Rn to the left by 2, 8, or 16 bits, and 
stores the result in Rn. Bits that are shifted out of the operand are not stored (figure 6.10). 


SHLL16 





Figure 6.10 Shift Logical Left n Bits 


Operation: 


SHLL2(long n) /* SHLL2 Rn */ 


R{n]<<=2; 
PC+=2; 
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SHLL8(long n) /* SHLL8 Rn */ 
{ 

R[{n]<<=8; 

PCt+=2; 
} 


SHLL16(long n) /* SHLL16 Rn */ 
{ 


R[{n]<<=16; 
PC+=2; 
} 
Examples: 
SHLL2 RO Before execution RO = H'12345678 
After execution RO = H'48D159E0 
SHLL8 RO Before execution RO = H'12345678 
After execution RO = H'34567800 
SHLL16 RO Before execution RO = H'12345678 
After execution RO = H'56780000 
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6.57 SHLR (Shift Logical Right): Shift Instruction 
Format Abstract Code State TBit 
SHLR Rn O>Rn—>T | 0100nnnn00000001 1 LSB 


Description: Logically shifts the contents of general register Rn to the right by one bit, and stores 
the result in Rn. The bit that is shifted out of the operand is transferred to the T bit (figure 6.11). 





Figure 6.11 Shift Logical Right 


Operation: 


SHLR(long n) /* SHLR Rn */ 
{ 
if ((R[{n]&0x00000001)==0) T=0; 


else T=1; 
R[n]>>=1; 
R[n] &=Ox7FFFFFFF; 
PC+=2; 
} 
Examples 
SHLR RO Before execution RO = H'80000001, T = 0 


After execution _ RO = H'40000000, T = 1 
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6.58 SHLRn (Shift Logical Right n Bits): Shift Instruction 





Format Abstract Code State T Bit 
SHLR2 = Rn Rn>>2 — Rn 0100nnnn00001001 1 — 
SHLR8 Rn Rn>>8 — Rn 0100nnnn00011001 1 — 
SHLR16 Rn Rn>>16 — Rn 0100nnnn00101001 1 — 





Description: Logically shifts the contents of general register Rn to the right by 2, 8, or 16 bits, 
and stores the result in Rn. Bits that are shifted out of the operand are not stored (figure 6.12). 


SHLR16 





Figure 6.12 Shift Logical Right n Bits 
Operation: 


SHLR2(long n) /* SHLR2 Rn */ 
{ 

R[n]>>=2; 

R[n] &=Ox3FFFFFFF; 

PC+=2; 
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SHLR8(long n) /* SHLR8 Rn */ 


{ 


| R[n]>>=8; 


R[n] &=0x00FFFFFF; 


~PC+=2; 
} 


{ 


~R[n])>>=16; 


SHLR16(long n) /* SHLR16 Rn */ 


R[n] &=Ox0000FFFF; 


PC+=2; 
} 

Examples: 
SHLR2 RO 
SHLR8 RO 
SHLR16  ~—RO 


Before execution 
After execution 
Before execution 


After execution 


Before execution 


After execution 


RO = H'12345678 
RO = H'048D159E 
RO = H'12345678 
RO = H'00123456 
RO = H'12345678 
RO = H'00001234 
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6.59 SLEEP (Sleep): System Control Instruction 


Format Abstract Code State T Bit 
SLEEP | Sleep 0000000000011011 3 — 


Description: Sets the CPU into power-down mode. In power-down mode, instruction execution 
stops, but the CPU module state is maintained, and the CPU waits for an interrupt request. If an 
interrupt is requested, the CPU exits the power-down mode and begins exception processing. 


Note: The number of cycles given is for the transition to sleep mode. 
Operation: 
SLEEP() /* SLEEP */ 
{ 
Wait_for_exception ; 
} 
Example: 


SLEEP __ Transits power-down mode 
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6.60 STC (Store Control Register): System Control Instruction 


Class: Interrupt disabled instruction — 


Format , Abstract  . Code : State T Bit 





STC SR, Rn SR > Rn | 0000nnnn00000010 1 — 
STC GBR, Rn GBR — Rn ae 0000nnnn00010010 1 — 
STC VBR, Rn VBR — Rn 0000nnnn00100010 1 — 
STC.L SR,@-Rn Rn —4— Rn, SR > (Rn) 0100nnnn00000011 2 — 
STC.L GBR, @-Rn Rn—4-— Rn, GBR—- (Rn) —‘0100nnnn00010011 2 — 
STC.L VBR, @-Rn Rn — 4 — Rn, VBR => (Rn) 0100nnnn00100011 2 — 


Description: Stores control registers SR, GBR, or VBR data into a specified destination. 


Note: No interrupts are accepted between this instruction and the next instruction. Address errors 
are accepted. 


Operation: 


STCSR(long n) /* STC SR,Rn */ 
{ 

R[n]=SR; 

PC+=2; 
} 


STCGBR(long n) /* STC GBR,Rn */ 
{ 

R[n]=GBR; 

PC+=2; 
} 


STCVBR(long n) /* STC VBR,Rn */ 


{ 
R[n]=VBR; 
PC+=2; 
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STCMSR(long n) /* STC.L SR,@-Rn */ 
{ 

R[n]-=4; 

Write_Long(R[n],SR); 

PC+=2; 
} 


STCMGBR(long n) /* STC.L GBR,@-Rn */ 
{ 

R[n]-=4; 

Write_Long(R[n],GBR) ; 

PC+=2; 
} 


STCMVBR(long n) 7* STC.L VBR,@-Rn */ 
{ 


R[n]-=4; 
Write_Long(R[n],VBR) ; 
PC+=2; 
} 
Examples 
STC SR, RO Before execution RO = H'FFFFFFFF, SR = H'00000000 
After execution RO = H'00000000 
STC.L GBR,@-R15 Before execution R15 = H'10000004 
After execution R15 = H'10000000, @R15 = GBR 


133 


6.61 STS (Store System Register): System Control Instruction 


Class: Interrupt disabled instruction 


Format Abstract Code State T Bit 
STS MACH, Rn MACH -> Rn 0000nnnn00001010 1 — 
STS MACL, Rn MACL — Rn 0000nnnn00011010 1 — 
STS PR, Rn PR — Rn 0000nnnn00101010 1 — 
STS.L MACH,@Rn Rn—4-»Rn,MACH—> (Rn) 0100nnnn00000010 ~=— 1 = 
STS.L MACL,@-Rn Rn—4-— Rn, MACL - (Rn) 0100nnnn00010010 1 — 
STS.L PR, @-Rn Rn — 4 > Rn, PR => (Rn) 0100nnnn00100010 1 — 


Description: Stores system registers MACH, MACL and PR data into a specified destination. 


Note: No interrupts are accepted between this instruction and the next instruction. Address errors 
are accepted. | 


If the system register is MACH in the SH-1 series, the value of bit 9 is transferred to and stored in 
the higher 22 bits (bits 31 to 10) of the destination. With the SH-2 series, the 32 bits of MACH are 
stored directly. 


Operation: 


STSMACH (long n) 7/* STS .MACH,Rn */ 
{ 
R[n]=MACH; 
= naa scsi tdi dans atineess 
R[n] &=0x000003FF; i needed for SH-2 CPU) 
else R[n]|=0xFFFFFCO0; 


STSMACL(long n) /* STS MACL,Rn */ 
1 

R[{n]=MACL; 

PC+=2; 
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STSPR(long n) 


{ 


R[n]=PR; 
PC+=2; 


} 


STSMMACH (long n) 


/* STS PR,Rn */ 


/* STS.L MACH, @-Rn */ 


if ((MACH&0x00000200 )==0) 
Write_Long(R[n] ,MACH&0x000003FF) ; 


else Write_Long 


(R[n] , MACH | OxFFFFFCO0O) 


OOCOOSA SS EOAEEHEDOSETCOSTOAHOREOOTUOCEEEOCRROSEROSOHEREReEEEeeenosonaesoesaeet 


For SH-1 CPU 


/* STS.L MACL, @-Rn */ 


Write_Long(R[n] ,MACL) ; 


STSMPR(long n) /* STS.L PR, @-Rn */ 


Write_Long(R[n],PR); 


PC+=2; 
} 
STSMMACL (long n) 
{ 
R[n]-=4; 
PCt+=2; 
} 
{ 
R[n]-—=4; 
PC+=2; 
Example: 
STS MACH , RO 
STS.L PR, @—-R15 


Before execution 


After execution 


Before execution 


After execution 


RO = H'FFFFFFFF, MACH = H'00000000 
RO = H'00000000 


R15 = H'10000004 
R15 = H'10000000, @R15 = PR 
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6.62 SUB (Subtract Binary): Arithmetic Instruction 


Format Abstract Code State T Bit 
SUB Rm, Rn ~Rn-—-Rm-—- Rn 0011nnnnmmmm1000 1 — 





Description: Subtracts general register Rm data from Rn data, and stores the result in Rn. To 
subtract immediate data, use ADD #imm,Rn. 


Operation: 


SUB(long m,long n) /* SUB Rm,Rn */ 
{ 


R[n] -=R[m] ; 
PCt+=2; 
} 
Example: 


SUB RO,R1 Before execution RO =H'00000001, R1 = H'80000000 
After execution R1 = H'7FFFFFFF 
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6.63 SUBC (Subtract with Carry): Arithmetic Instruction 


Format Abstract Code State T Bit 
SUBC Rm, Rn Rn — Rm—T — Rn, Borrow > T 0011 nnnnmmmm1010 1 Borrow 





Description: Subtracts Rm data and the T bit value from general register Rn, and stores the result 
in Rn. The T bit changes according to the result. This instruction is used for subtraction of data 
that has more than 32 bits. 


Operation: 


SUBC(long m,long n) /* SUBC Rm,Rn */ 
{ 
unsigned long tmp0,tmp1; 


tmp1=R[n]-R{m] ; 
tmp0=R[n] ; 
R[n]=tmp1-T; 

if (tmp0<tmpl) T=1; 
else T=0; 


if (tmp1<R[n]) T=1; 


PCt+=2; 
} 
Examples: 

CLRT RO:R1(64 bits) — R2:R3(64 bits) = RO:R1(64 bits) 

SUBC R3,R1 ~~ Before execution T = 0, R1I = H'00000000, R3 = H'00000001 
After execution T = 1, Rl = H'FFFFFFFF 

SUBC  R2,RO Before execution T = 1, RO = H'00000000, R2 = H'00000000 
After execution T = 1, RO = H'FFFFFFFF 
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6.64 SUBV (Subtract with V _— Underflow Check): Arithmetic | 


Instruction — 7 
Format — Abstract Code = s—“‘i~‘s*C‘SS tatto@’:—SsCT Bit 
-SUBV  Rm,Rn Rn—Rm-»Rn, Underflow—T 0011nnnnmmmm1011 1  — Underflow 


Description: Subtracts Rm data from general register Rn data, and stores the result in Rn. If an 
underflow occurs, the T bit is set to 1. 


Operation: 


SUBV(long m,long n) /* SUBV Rm,Rn */ 
{ 


long dest,src,ans; 


if ((long)R[n]>=0) dest=0; 
else dest=1; 
Lt ((1ong)R[m]>=0) src=0; 
else src=1; 
srct=dest; 
R[n]-=R[m] ; 
if ((long)R[n]>=0) ans=0; 
else ans=1; 
ans+=dest ; 
if (src==1) { 
if (ans==1) T=1; 


else T=0; 
} 
else T=0; 
PC+=2; 
} 
Examples: 


SUBV RO,R1 Before execution RO = H'00000002, R1 = H'80000001 
After execution R1 = H'7FFFFFFF, T = 1 


SUBV R2,R3  Beforeexecution | R2=H'EFFFEFFE, R3 = H'7EFFFFFE 
After execution R3 = H'80000000, T = 1 | 
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6.65 SWAP (Swap Register Halves): Data Transfer Instruction 


Format Abstract | Code State T Bit 

SWAP.B Rm,Rn Rm — Swap upper and lower 0110nnnnmmmm1000 1 — 
halves of lower 2 bytes > Rn 

SWAP.W Rm,Rn Rm — Swap upper and lower 0110nnnnmmmm1001 1 _ 
word — Rn 


Description: Swaps the upper and lower bytes of the general register Rm data, and stores the 
result in Rn. If a byte is specified, bits 0 to 7 of Rm are swapped for bits 8 to 15. The upper 16 bits 
of Rm are transferred to the upper 16 bits of Rn. If a word is specified, bits 0 to 15 of Rm are 
swapped for bits 16 to 31. 


Operation: 


SWAPB(long m,long n) /* SWAP.B Rm,Rn */ 


{ 
unsigned long temp0,temp1; 


temp0=R[m]&0OxfffFf0000; 
temp1=(R[m] &0x000000fF)<<8; 
R[n]=(R[m]>>8) &Ox000000Ff £; 
R[n]=R[n] | temp1| tempo; 
PC+=2; 

} 


SWAPW(long m,long n) /* SWAP.W Rm,Rn */ 
{ 
unsigned long temp; 
temp=(R[m]>>16) &OxOO00FFFF; 
R[{n]=R[m]<<16; 


R[{n] |=temp; 
PC+=2; 
} 
Examples 


SWAP.B RO,R1 Before execution RO=H'12345678 
After execution R1 = H'12347856 


SWAP.W RO,R1 Before execution RO =H'l234567 8 
After execution R1 = H'56781234 
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6.66 TAS (Test and Set): Logic Operation Instruction 


Format _ Abstract | | Code State T Bit 
TAS.B @Rn When(Rn)is0O,1—37,1—-MSBof(Rn) 0100nnnn00011011 4 Test 
results 


Description: Reads byte data from the address specified by general register Rn, and sets the T bit 
to | if the data is 0, or clears the T bit to 0 if the data is not 0. Then, data bit 7 is set to 1, and the 
data is written to the address specified by Rn. During this operation, the bus is not released. 


Operation: 


TAS(long n) /* TAS.B @Rn */ 


{ 
long temp; 


temp=(long)Read_Byte(R[n]); /* Bus Lock enable */ 
if (temp==0) T=1; 

else T=0; 

temp | =0x00000080; 

Write_Byte(R[n],temp); /* Bus Lock disable */ 


PC+=2; 
} 
Example: 
_LOOP TAS.B_ @R7 R7 = 1000 


BF _ LOOP Loops until data in address 1000 is 0 
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6.67 TRAPA (Trap Always): System Control Instruction 


Format Abstract Code State T Bit 
TRAPA #imm PC/SR — Stack area, (immx4+  11000011iiiiiiii 8 — 
VBR) — PC 


Description: Starts the trap exception processing. The PC and SR values are stored on the stack, 
and the program branches to an address specified by the vector. The vector is a memory address 
obtained by zero-extending the 8-bit immediate data and then quadrupling it. The PC points the 
starting address of the next instruction. TRAPA and RTE are both used for system calls. 


Operation: 


TRAPA(long i) /* TRAPA #imm */ 
{ 


long imn; 


imm=(OxO000000FF & i); 


R[15]-=4; 
Write_Long(R[15],SR); 
R[15]-=4; 


Write_Long(R[15],PC-2); 
PC=Read_Long (VBR+( imm<<2) )+4; 


} 


Example: 
Address 
VBR+H ' 80 .data.l 10000000 
TRAPA  #H'20 Branches to an address specified by data in address VBR + 
H'80 
TST #0,RO < Return address from the trap routine (stacked PC value) 
100000000 XOR RO, RO <- Trap routine entrance 
100000002 —s RTE Returns to the TST instruction 
100000004 NOP Executes NOP before RTE 
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6.68 TST (Test Logical): Logic Operation Instruction | 


Format == Abstract © Code te State TBit | 

TST  Rm,Rn 7 Rn &Rm, when resultis 0010nnnnmmmm1000 1 Test 
| 0,1T | | a results 

TST  #imm, RO | RO &imm, when result is 11001000iiiiiiii 1 Test 
0,15 T results 

TST.B #imm, @(RO,GBR) (RO + GBR) & imm, when 11001100iiiiiiii o ag ~ Test 
, ~ - regultis 0, 1 — T ae | | -- results | 


Description: Logically ANDs the contents of general registers Rn and Rm, and sets the T bit to 1 
if the result is 0 or clears the T bit to 0 if the result is not 0. The Rn data does not change. The 
contents of general register RO can also be ANDed with zero-extended 8-bit immediate data, or the 
contents of 8-bit memory accessed by indirect indexed GBR addressing can be ANDed with 8-bit 
immediate data. The RO and memory data do not change. 


Operation: 


TST(long m,long n)/* TST Rm,Rn */ 
{ 

if ((R[n]&R[m] )==0) T=1; 

else T=0; 

PC+=2 ; 
} 


TSTI(long i) /* TEST #imm,RO */ 
{ 
long temp; 


temp=R[0]&(OxO00000FF & (long)1i); 
if. (temp==0) T=1; 
else T=0; 
PC+=2; 
} 


TSTM(long i) /* TST.B #imm,@(RO,GBR) */ 
{ 
long temp; 
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temp=(long)Read_Byte(GBR+tR[0]); 
temp&=(0x000000FF & (long)i); 
if (temp==0) T=1; 


else T=0; 
PC+=2; 
} 
Examples: 
TST RO, RO Before execution RO =H'00000000 
After execution T=1 
TST #H'80,RO Before execution RO = H'FFFFFF7F 
After execution T=] 


TST.B  #H'A5,@(RO,GBR)  Beforeexecution @(RO,GBR) =H'A5S 


After execution T=0 
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6.69 XOR (Exclusive OR Logical): Logic Operation Instruction 





Format Abstract | Code | State T Bit 
XOR  Rm,Rn Rn 4Rm— Rn 0010nnnnmmmm1010 1 _— 
XOR  #imm,RO RO “imm — RO 11001010iiiiiiii 861 — 
XOR.B #imm,@(RO,GBR) (RO+GBR)4imm-— (RO 11001110iiiiiiii 3 me 

+ GBR) 


Description: Exclusive ORs the contents of general registers Rn and Rm, and stores the result in 
Rn. The contents of general register RO can also be exclusive ORed with zero-extended 8-bit 


immediate data, or 8-bit memory accessed by indirect indexed GBR addressing can be exclusive 
ORed with 8-bit immediate data. 


Operation: 


XOR(long m,long n)/* XOR Rm,Rn */ 
{ 
— R[n]*=Rim] ; 
PC+=2; 
} 


XORI(long i) /* XOR #imm,RO */ 
{ 
R[0]*=(Ox000000FF & (long)i); 
PC+=2; 
} 


XORM(long i) /* XOR.B #imm,@(RO,GBR) */ 
{ 
long temp; 


temp=(long)Read_Byte(GBR+tR[0] ); 
temp*=(Ox000000FF & (long)i); 
Write_Byte(GBR+R[0],temp) ; 
PC+=2; 
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Examples: 


XOR RO,R1 


XOR #H'FO,RO 


XOR.B #H'A5,@(RO,GBR) 


Before execution 


After execution 


Before execution 


After execution 


Before execution 


After execution 
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RO = H'AAAAAAAA, R1 = H'55555555 
R1 = H'FFFFFFFF 


RO = H'FFFFFFFF 
RO = H'FFFFFFOF 


@(RO,GBR) = H'A5 
@(RO,GBR) = H'00 


6.70 XTRCT (Extract): Data Transfer Instruction 





Format _ _ Abstract | 3 Code State TBit 
XTRCT Rm,Rn Center 32 bits of Rm and Rn > 0010nnnnmmmm1101 1 — 
Rn 


Description: Extracts the middle 32 bits from the 64 bits of general registers Rm and Rn, anes 
stores the 32 bits in Rn neue 6. 13). 


LSB MSB 





Figure 6.13 Extract 
Operation: 


XTRCT( long m,long n) /* XTRCT Rm,Rn */ 
{ 


unsigned long temp; 


temp=(R[m]<<16) &OxFFFFO000; 
R({n]=(R[n]>>16) &OxXOOOOFFFF; 
R[n] |=temp; 
PC+=2; 

} 


Example: 


XTRCT. RO,R1 Beforeexecution RO =H'01234567, Rl = H'89ABCDEF 
After execution R1 = H'456789AB 
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Section 7 Pipeline Operation 


This section describes the operation of the pipelines for each instruction. This information is 
_ provided to allow calculation of the required number of CPU instruction execution states (system 
clock cycles). 


7.1 Basic Configuration of Pipelines 


Pipelines are composed of the following five stages: 


¢ IF (Instruction fetch) Fetches an instruction from the memory in which the program is _ 
stored. 


ID (Instruction decode) Decodes the instruction fetched. 


e EX (Instruction execution) Performs data operations and address calculations according to the 
results of decoding. 


¢ MA (Memory access) Accesses data in memory. Generated by instructions that involve 
memory access, with some exceptions. 

¢ WB (Write back) Returns the results of the memory access (data) to a register. 
Generated by instructions that involve memory loads, with some 
exceptions. 


As shown in figure 7.1, these stages flow with the execution of the instructions and thereby 
constitute a pipeline. At a given instant, five instructions are being executed simultaneously. All 
instructions have at least 3 stages: IF, ID, and EX. Most, but not all, have stages MA and WB as 
well. The way the pipeline flows also varies with the type of instruction. The basic pipeline flow is 
as shown in figure 7.1; some pipelines differ, however, because of contention between IF and MA. 
In figure 7.1, the period in which a single stage is operating is called a slot. 


> + > +} > YY OY | LY @& : Siot 


Instruction 1 IF ID EX MA WB a 


Instruction 2 EX MA WB stream 


Instruction 3 ID EX MA WB 


Instruction 4 IF ID EX MA WB 
Instruction 5 EX MA WB 
Instruction 6 ID EX MA WB 





Figure 7.1 Basic Structure of Pipeline Flow 
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7.2 Slot and Pipeline Flow 


The time period in which a single stage operates is called a slot. Slots must follow the rules 
described below. 


7.2.1 Instruction Execution 


Each stage (IF, ID, EX, MA, and WB) of an instruction must be executed in one slot. Two or more 
stages cannot be executed within one slot (figure 7.2), with exception of WB and MA. Since WB 
is executed immediately after MA, however, some instructions may execute MA and WB within 
the same slot. 


+> + - > YD OD OY OH LY @- :: Slot 
Instruction 1 iF ID EX MA WB 


Instruction 2 IF ID EX MA WB 


Note: ID and EX of instruction 1 are being executed in the same slot. 





Figure 7.2 Impossible Pipeline Flow 1 


7.2.2 Slot Sharing 


A maximum of one stage from another instruction may be set per slot, and that stage must be — 
different from the stage of the first instruction. Identical stages from two different instructions may 
never be executed within the same slot (figure 7.3). 


+ <> > < & @ : Slo 
Instruction 1 WB 
Instruction 2 | WB 
Instruction 3 MA WB 
Instruction 4 EX MA WB 
Instruction 5 EX MA WB 





Note: Same stage of another instruction is being executed in same slot. 


Figure 7.3 Impossible Pipeline Flow 2 
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7.2.3 Slot Length 


The number of states (system clock cycles) S for the execution of one slot is calculated with the 
following conditions: 


¢ S =(the cycles of the stage with the highest number of cycles of all instruction stages contained 
in the slot) 


This means that the instruction with the longest stage stalls others with shorter stages. 
¢ The number of execution cycles for each stage: 


— IF The number of memory access cycles for instruction fetch 


— ID Always one cycle 
— EX Always one cycle 
— MA The number of memory access cycles for data access 
— WB Always one cycle 


As an example, figure 7.4 shows the flow of a pipeline in which the IF (memory access for 
instruction fetch) of instructions 1 and 2 are two cycles, the MA (memory access for data access) 
of instruction 1 is three cycles and all others are one cycle. The dashes indicate the instruction is 
being stalled. 


+> +> + SO + : Slo 
(2) (2) (1) (3) (1) (1) Number of 


Instruction’ IF IF 1D — EX MA MA MA WB 2 


Instruction 2 IF IF ID EX — — MA WB 





Figure 7.4 Slots Requiring Multiple Cycles 
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7.3 Number of Instruction Execution States 


The number of instruction execution states is counted as the interval between execution of EX 
stages. The number of states between the start of the EX stage for instruction 1 and the start of the 
EX stage for the following instruction (instruction 2) is the execution time for instruction 1. 


For example, in a pipeline flow like that shown in figure 7.5, the EX stage interval between 
instructions 1 and 2 is five cycles, so the execution time for instruction 1 is five cycles. Since the 
interval between EX stages for instructions 2 and 3 is one state, the execution time of instruction 2 
is one state. 


If a program ends with instruction 3, the execution time for instruction 3 should be calculated as 
the interval between the EX stage of instruction 3 and the EX stage of a hypothetical instruction 4, 
using an MOV Rn, Rn that follows instruction 3. (In the case of figure 7.5, the execution time of 
instruction 3 would thus be one cycle.) In this example, the MA of instruction 1 and the IF of 
instruction 4 are in contention. For operation during the contention between the MA and IF, see 
section 7.4, Contention Between Instruction Fetch (IF) and Memory Access (MA). The execution 
time between instructions | and 3 in figure 7.5 is seven states (5+ 1+ 1). © 


<> + # :: Sit 
(4) (1) (1) 
Instruction 1 MA MA MA WB 


Instruction 2 —_— — 
Instruction3 = 82 <= AD MA 
(Instruction 4: | IF ID [EX]) © 





Figure 7.5 How Instruction Execution States Are Counted 
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7.4 Contention Between Instruction Fetch (IF) and Memory Access (MA) 


7.4.1 Basic Operation When IF and MA are in Contention 


The IF and MA stages both access memory, so they cannot operate simultaneously. When the IF 
and MA stages both try to access memory within the same slot, the slot splits as shown in figure 
7.6. When there is a WB, it is executed immediately after the MA ends. | 


A B CC OD E F G 
> +? or > + OY LO |] @ : Slot 


Instruction 1 IF ID EX WB MA of instruction 1 and IF of instruction 4 

Instruction 2 IF ID EX WB contend at D 

Instruction 3 IF ID EX MA of instruction 2 and IF of instruction 5 
contend at E 

Instruction 4 ID EX 

Instruction 5 ID EX 


When MA and IF are in contention, the following occurs: 


A B C OD E F G 
>> +> «+> «<r > @ : Slot 


Instruction! IF ID EX WB Split at D 
Instruction 2 IF — EX WB Split at E 
Instruction 3 ID — EX 

Instruction 4 — ID EX 

Instruction 5 ID EX 





Figure 7.6 Operation When IF and MA Are in Contention 


The slots in which MA and IF contend are split. MA is given priority to execute in the first half 
(when there is a WB, it immediately follows the MA), and the EX, ID, and IF are executed 
simultaneously in the latter half. For example, in figure 7.6 the MA of instruction 1 is executed in 
slot D while the EX of instruction 2, the ID of instruction 3 and IF of instruction 4 are executed 
simultaneously thereafter. In slot E, the MA of instruction 2 is given priority and the EX of 
instruction 3, the ID of instruction 4 and the IF of instruction 5 executed thereafter. 


The number of states for a slot in which MA and IF are in contention is the sum of the number of 
memory access cycles for the MA and the number of memory access cycles for the IF. 
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7.4.2 The Relationship Between IF and the Location of Instructions in On-Chip 
ROM/RAM or On-Chip Memory 


When the instruction is located in the on-chip memory (ROM or RAM) or on-chip cache of the 
SH microcomputer, the SH microcomputer accesses the on-chip memory in 32-bit units. The SH 
microcomputer instructions are all fixed at 16 bits, so basically 2 instructions can be fetched in a 
single IF stage access. | | 


If an instruction is located on a longword boundary, an IF can get two instructions at each 
instruction fetch. The IF of the next instruction does not generate a bus cycle to fetch an 
instruction from memory. Since the next instruction IF also fetches two instructions, the 
instruction IFs after that do not generate a bus cycle either. 


This means that IFs of instructions that are located so they start from the longword boundaries 
within instructions located in on-chip memory (the position when the bottom two bits of the 
instruction address are 00 is Al = O and AO = 0) also fetch two instructions. The IF of the next 
instruction does not generate a bus cycle. IFs that do not generate bus cycles are written in lower 
case as ‘if’. These ‘if’s always take one state. 


When branching results in a fetch from an instruction located so it starts from the word boundaries 
(the position when the bottom two bits of the instruction address are 10 is Al = 1, AO = 0), the bus 
cycle of the IF fetches only the specified instruction more than one of said instructions. The IF of 
the next instruction thus generates a bus cycle, and fetches two instructions. Figure 7.7 illustrates 
these operations. 
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: > oD DP YP LY LH LH @  : Slo 
Instruc-l| Instruc- |--: Instruction 1 ID EX 
tion 1} tion2 | Instruction 2 if ID EX 
instruc-|| Instruc- | --* Instruction 3 ID EX 
tion3 |} tion4 | Instruction 4 if ID EX 
Instruc- --- Instruction 5 ID EX 
tion 5 Instruction 6 if ID EX 


(On-chip memory 


or on-chip cache) : Bus cycle generated 


if : Nobuscycle 


Fetching from an instruction (instruction 1) located on a longword boundary 


<> <> +> +> +? >? +H > +> @? : Slot 


Instruc- 
i tion 2 || °° Instruction 2 ID EX 


--- Instruction 3 ID EX 
Instruc-}] Instruc- | 
Instruction 4 | ff ID EX 
--+ Instruction 5 ID EX 
Instruc-|/ Instruc- 
tion 6 Instruction 6 iff ID EX 


: Bus cycle generated 
if : Nobuscycle 


Fetching from an instruction (instruction 2) located on a word boundary 





Figure 7.7 Relationship Between IF and Location of Instructions in On-Chip Memory 


7.4.3 Relationship Between Position of Instructions Located in On-Chip ROM/RAM or 
On-Chip Memory and Contention Between IF and MA 


When an instruction is located in on-chip memory (ROM/RAM) or on-chip cache, there are 
instruction fetch stages (‘if’ written in lower case) that do not generate bus cycles as explained in 
section 7.4.2 above. When an if is in contention with an MA, the slot will not split, as it does when 
an IF and an MA are in contention, because ifs and MAs can be executed simultaneously. Such 
slots execute in the number of states the MA requires for memory access, as illustrated in figure 
7.8. 


When programming, avoid contention of MA and IF whenever possible and pair MAs with ifs to 
increase the instruction execution speed. Instructions that have 4 (5)-stage pipelines of IF, ID, EX, 
MA, (WB) prevent stalls when they start from the longword boundaries in on-chip memory (the 
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position when the bottom 2 bits of instruction address are 00 is Al = 0 and AO = 0) because the 
MA of the instruction falls in the same slot as ifs that follow. 


instruc- nee Instruction | 
tion 1 Instruction 2 if 
Instruc-|] Instruc- | **: Instruction 3 
tion 3 || tion 4 Instruction 4 


Instruc-|] Instruc- |... Instruction 5 : ID EX 
tion 5 || tion 6 Instruction 6 if ID EX 


(On-chip memory ; : Splits 
or on-chip cache) Baris 


‘if: : Does not split 


weoww d 


MA in slot A is in contention with an if, so no split occurs. 
MA in slot B is in contention with an IF, so it splits. 





Figure 7.8 Relationship Between the Location of Instructions in On-Chip Memory and 
Contention Between IF and MA 


7.5 Effects of Memory Load Instructions on Pipelines 


Instructions that involve loading from memory return data to the destination register during the 
WB stage that comes at the end of the pipeline. The WB stage of such a load instruction (load 
instruction 1) will thus come after the EX stage of the instruction that immediately follows it 
(instruction 2). . 


When instruction 2 uses the same destination register as load instruction 1, the contents of that 
register will not be ready, so any slot containing the MA of instruction 1 and EX of instruction 2 
will split. The destination register of load instruction 1 is the same as the destination (not the _ 
source) of instruction 2, so it splits. a 


When the destination of load instruction 1 is the status register (SR) and the flag in it is fetched by 
instruction 2 (as ADDC does), a split occurs. No split occurs, however, in the following cases: 


¢ When instruction 2 is a load instruction and its destination is the same as that of load 
instruction 1. | 


¢ When instruction 2 is Mac @Rm+ , @Rn+, and the destination of load instruction 1 are the 
same. 
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The number of states in the slot generated by the split is the number of MA cycles plus the number 
of IF (or if) cycles, as illustrated in figure 7.9. This means the execution speed will be lowered if 
the instruction that will use the results of the load instruction is placed immediately after the load 
instruction. The instruction that uses the result of the load instruction will not slow down the 
program if placed one or more instructions after the load instruction. 


<> <> +> + > +} +: Slot 
Load instruction 1 (MOV.W @RO,R1) IF ID EX WB 


Instruction 2 (ADD R1, R2) IF 
Instruction 3 
Instruction 4 





Figure 7.9 Effects of Memory Load Instructions on the Pipeline 


7.6 Programming Guide 
To improve instruction execution speed, consider the following when programming: 
¢ To prevent contention between MA and IF, locate instructions that have MA stages so they start 


from the longword boundaries of on-chip memory (the position when the bottom two bits of the 
instruction address are 00 is Al = 0 and AO = 0) wherever possible. 


e The instruction that immediately follows an instruction that loads from memory should not use 
the same destination register as the load instruction. 


¢ Locate instructions that use the multiplier nonconsecutively. Also locate nonconsecutively an 
access to the MACH or MACL register for fetching the results from the multiplier and an 
instruction that uses the multiplier. 
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7.7 Operation of Instruction Pipelines 


This section describes the operation of the instruction pipelines. By eoabining these with the rules 
described so far, the way pipelines flow in a program and the number of instruction execution 
states can be calculated. | 


In the following figures, “Instruction A” refers to the instruction being described. When “IF” is 
written in the instruction fetch stage, it may refer to either “IF” or “if”. When there is contention 
between IF and MA, the slot will split, but the manner of the split is not described in the tables, 
with a few exceptions. When a slot has split, see section 7.4, Contention Between Instruction 
Fetch (IF) and Memory Access (MA). Base your response on the rules for pipeline operation given 
there. 


Table 7.1 lists the format for number of instruction stages and execution states: 


Table 7.1 Format for the Number of Stages and Execution States for Instructions 





Type Category Stage State Contention Instruction. 
Functional Instruction Number Number Contention that Corresponding instructions 
types s are of of occurs represented by mnemonic 
catego- stages execu- 
— rized inan __ tion 
basedon __instruc- states 
operations — tion when 
no 
conten- 
tion 
occurs 


Table 7.2. Number of Instruction Stages and Execution States 


Type Category Stage State Contention Instruction 

Data Register- 3 1 — MOV #imm, Rn 

transfer register MOV Rm, Rn 

instructions transfer . 
MOVT Rn 


SWAP.B  Rm,Rn 
SWAP.W Rm,Rn 
XTRCT Rm, Rn 
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Table 7.2 


Type 

Data 
transfer 
instructions 
(cont) 


Number of Instruction Stages and Execution States (cont) 


Category Stage 
5 


Memory 
load 
instructions 


Memory 
store 
instructions 


4 


Contention 
Contention occurs 


if the instruction 
placed 
immediately after 


this one uses the 
same destination 


register 


MA contends with 
IF 


¢ MA contends with 


IF 
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Instruction 

MOV.W @(disp,PC),Rn 
MOV.L @(disp,PC),Rn 
MOV. @Rm, Rn 


MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 
MOV. 


rf wr fe wo PP =F wor sw PF f= OJyr fF wer ese wore ef dow Pease ses wD 


@Rm, Rn 

@Rm, Rn 

@Rm+, Rn 

@Rm+, Rn 
@Rm+, Rn 
@(disp,Rm),R0O 
@(disp,Rm),RO 
@(disp,Rm),Rn 
@(RO,Rm),Rn 
@(RO,Rm),Rn 
@(RO,Rm),Rn 
@(disp,GBR),RO 
@(disp,GBR) ,RO 
@(disp, GBR) , RO 
Rm, @Rn 

Rm, @Rn 

Rm, @Rn 

Rm, @—Rn 

Rm, @—Rn 

Rm, @—Rn 

RO, @(disp, Rn) 
RO, @(disp, Rn) 
Rm, @(disp, Rn) 
Rm, @(RO,Rn) 
Rm, @(RO,Rn) 
Rm, @(RO,Rn) 
RO, @(disp,GBR) 
RO, @( disp, GBR) 
RO, @(disp, GBR) 


Table 7.2 Number of Instruction Stages and Execution States (cont) 





Type Category Stage State Contention. -.. Instruction 
Arithmetic Arithmetc 3 91 & &32— | | ADD —-_—sRm, Rn 
instructions instructions ADD #imm, Rn 
between | 
registers 7 ADDC Rm, Rn 
(except | ADDV Rm, Rn 
multiplic- 
ation | : a, CMP/EQ  #imm,R0O 
instruc- : CMP/EQ Rm,Rn 
tions) : , 


CMP/HS = Rm,Rn 
CMP/GE _Rm,Rn 
CMP/HI  Rm,Rn 
CMP/GT — Rm,Rn 


CMP/PZ Rn 
CMP/PL Rn 
CMP/STR Rm,Rn 
DIV1 Rm, Rn 
DIVOS Rm, Rn 
DIVOU 

DT Rn*?3 


EXTS .B Rm, Rn 
EXTS.W = Rm, Rn 
EXTU.B Rm, Rn 
EXTU .W Rm, Rn 


NEG Rm, Rn 
NEGC Rm, Rn 
SUB Rm, Rn 
SUBC Rm, Rn 
SUBV Rm, Rn 
Multiply/ 7/8"! 3/(2)*2 + ~Multiplier contention MAC.W  @Rm+, @Rn+ 
accumulate | occurs when an 
instructions instruction that uses 


the multiplier follows a 
MAC instruction 


| ¢ MA contends with IF 
Notes 1. Inthe SH-2 CPU, multiply/accumulate instructions are 7 stages, multiply instructions 6 


stages; in the SH-1 CPU, multiply/accumulate instructions are 8 stages, multiply 
instructions 7 stages - 


2. The normal minimum number of execution states (The number in parentheses is the 
number of states when there is contention with preceding/following instructions) — 


3. SH-2 CPU instructions 
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Table 7.2. Number of Instruction Stages and Execution States (cont) 


Type Category Stage State Contention Instruction 
Arithmetic Double- S) 3/(2to + Multiplier MAC.L @Rm+, @Rn+*?3 
instructions length 4)*2 contention occurs 
(cont) multiply/ when an 

accumulate instruction that 

instruction uses the multiplier 

(SH-2 CPU follows a MAC 

only) instruction 

¢ MA contends with 
IF 
Multiplic- 6/7"! 1 to 3*2 + Multiplier MULS.W Rm, Rn 
ation contention occurs 


MULU .W Rm, Rn 
instructions when an instruc- 


tion that uses the 
multiplier follows 
a MUL instruction 


¢ MA contends with 


IF 
Double- 9 2to4*2 + Multiplier DMULS.L Rm,Rn*? 
length | contention occurs pyr, Rm, Rn*3 
multiply/ when an y 
accumulate instruction that MUL .L Rm, Rn* 
instruction uses the multiplier 
(SH-2 CPU follows a MAC 
only) instruction 
¢ MA contends with 
IF 
Logic Register- 3 1 — AND Rm, Rn 
operation register AND #4mm, RO 
instructions logic 
operation NOT Rm, Rn 
instructions OR Rm, Rn 
OR #imm, RO 
TST Rm, Rn 
TST #imm, RO 
XOR Rm, Rn 
XOR #imm, RO 


Notes 1. In the SH-2 CPU, multiply/accumulate instructions are 7 stages, multiply instructions 6 
stages; in the SH-1 CPU, multiply/accumulate instructions are 8 stages, multiply 
instructions 7 stages | 

2. The normal minimum number of execution states (The number in parentheses is the 
number of cycles when there is contention with following instructions) 
3. SH-2 CPU instructions 
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Table 7.2 


Type 

Logic 
operation 
instructions 
(cont) 


Shift 
instructions 


Branch 
instructions 


Number of Instruction Stages and Execution States (cont) 


Category Stage 


Memory logic 6 3 
operations 
instructions 


TAS 6 4 
instruction 


Shift 3 1 
instructions 


Conditional 3 3/1*4 
branch 


instructions 


Delayed 3 
conditional 
branch 
instructions 
(SH-2 CPU 

only) 


2/1*4 


Unconditional 3 2 
branch 
instructions 


Notes 3. SH-2 CPU instruction 


4. One state when there is no branch 


State — 


Contention 


-« MAcontends 
with IF 


¢ MA contends 
with IF 
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Instruction 

AND.B  #imm, @(RO,GBR) 
OR.B #imm, @(RO,GBR) 
TST.B  #imm, @(RO,GBR) 
XOR.B  #imm, @(RO,GBR) 
TAS.B  @Rn 

ROTL Rn 

ROTR Rn 

ROTCL kn 

ROTCR Rn 

SHAL Rn 

SHAR Rn 

SHLL Rn 

SHLR Rn 

SHLL2 Rn 

SHLR2 Rn 

SHLL8 Rn 

SHLR8 Rn 

SHLL16 Rn 

SHLR16 Rn 

BF label 

BT label 

BF/S label*? 

BT/S label*? 

BRA label 

BRAF Rm*3 

BSR label 

BSRF Rm*? 

JMP @Rm 

JSR @Rm 

RTS | 


Table 7.2 


Type 


System 
control 


instructions 


Number of Instruction Stages and Execution States (cont) 


Category Stage 


System 
control 
ALU 


instructions 


LDC.L 
instruction 


STC.L 


instructions 


LDS.L 


instructions 


(PR) 


STS.L 
instruction 
(PR) 


3 


5 


4 


5 


4 


Contention 


Contention occurs 
when an 
instruction that 
uses the same 
destination 
register is placed 


immediately after 


this instruction 


MA contends with 
IF 


MA contends with 
IF 


Contention occurs 
when an 
instruction that 
uses the same 
destination 
register is placed 
immediately after 
this instruction 


MA contends with 
IF 


Instruction 

CLRT 

LDC Rm, SR 
LDC Rm, GBR 
LDC Rm, VBR 
LDS Rm, PR 
NOP 

SETT 

STC SR, Rn 
STC GBR, Rn 
STC VBR, Rn 
STS PR, Rn 
LDC. @Rm+, SR 
LDC.L @Rm+,GBR 
LDC. @Rm+,VBR 
STC.L SR, @-Rn 
STC.L GBR, @—Rn 
STC.L VBR, @-Rn 
LDS.L @Rmt+,PR 


MA contends with STS.L 


IF 
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PR ! @—Rn 


Table 7.2 


Type Category Stage State Contention ~ Instruction | 
System Register> 4 1 ¢ Contention occurs CLRMAC 
control MAC with multiplier IDS Rm, MACH 
instructions transfer 5, MAK Genisnds wih : 
(cont) instruction IE | LDS Rm, MACL 
Memory 4 1 ¢ Contention occurs LDS.L  @Rm+,MACH 
MAC with multiplier LDS.L  @Rm+,MACL 
transfer - MAcontends with 
instructions IF 
MAC —> 5 1 ¢ Contention occurs STS MACH, Rn 
register with multiplier srs MACL. Rn 
transfer * Contention occurs 
instruction when an 
instruction that 
uses the same 
destination 
register is placed 
immediately after 
this instruction 
¢ MA contends with 
IF 
MAC —> 4 1 ¢ Contention occurs STS.L MACH, @—Rn 
memory with multiplier STS.L  MACL, @-Rn 
transfer * MA contends with 
instruction IF 
RTE 5 4 — RTE 
instruction 
TRAP 9 8 — TRAPA #imm 
instruction 
SLEEP 3 3 — SLEEP 
instruction 


Number of Instruction Stages and Execution States (cont) 
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7.7.1 Data Transfer Instructions 


Register-Register Transfer Instructions: Include the following instruction types: 


¢ MOV #imm, Rn 

¢« MOV Rm, Rn 

« MOVA @(disp, PC), RO 
¢ MOVT Rn 


e SWAP.B Rm, Rn 
¢ SWAP.W Rm, Rn 
¢ XTRCT Rm, Rn 


> +> +} <> +} =: Slot 
Instruction A IF ID EX 
Next instruction 


Third instruction 





Figure 7.10 Register-Register Transfer Instruction Pipeline 


Operation: The pipeline ends after three stages: IF, ID, and EX. Data is transferred in the EX 
stage via the ALU. 
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Memory Load Instructions: Include the following instruction types: 


- MOV.W @(disp, PC), Rn 


¢ MOV.L @(disp, PC), Rn 

¢ MOV.B @Rm, Rn 

¢ MOV.W @Rm, Rn 

¢ MOV.L @Rm, Rn 

« MOV.B @Rm-+, Rn 

° MOV.W @Rm+, Rn 

¢ MOV.L @Rm-, Rn 

« MOV.B @(disp, Rm), RO 
¢ MOV.W @(disp, Rm), RO — 
¢ MOV.L @(disp, Rm), Rn 

¢ MOV.B @(RO, Rm), Rn 

¢ MOV.W @(RO, Rm), Rn 

¢ MOV.L @(RO, Rm), Rn 

¢ MOV.B @(disp, GBR), RO 
¢ MOV.W @(disp, GBR), RO 


¢ MOV.L @(disp, GBR), RO 


+> + + + > @: Slot 
InstructonA IF ID EX MB WB 


Next instruction , IF 
Third instruction IF ID EX 





Figure 7.11 Memory Load Instruction Pipeline 


Operation: The pipeline has five stages: IF, ID, EX, MA, and WB (figure 7.11). If an instruction 
that uses the same destination register as this instruction is placed immediately after it, contention 
will occur. (See Section 7.5, Effects of Memory Load Instructions on Pipelines.) 
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Memory Store Instructions: Include the following instruction types: 


¢« MOV.B Rm, @Rn 

¢« MOV.W Rm, @Rn 

¢ MOV.L Rm, @Rn 

« MOV.B Rm, @-Rn 

¢« MOV.W Rm, @-Rn 

e MOV.L Rm, @—Rn 

« MOV.B RO, @(disp, Rn) 
¢* MOV.W RO, @(disp, Rn) 
¢ MOV.L Rm, @(disp, Rn) 
¢ MOV.B Rm, @(RO, Rn) 

° MOV.W Rm, @(RO, Rn) 

¢ MOV.L Rm, @(RO, Rn) 

¢ MOV.B RO, @(disp, GBR) 
¢ MOV.W RO, @(disp, GBR) 
¢ MOV.L RO, @(disp, GBR) 


> + + +} & @] : Slot 
InstructionA IF ID EX MA 
Next instruction IF ID EX 


Third instruction IF 





Figure 7.12 Memory Store Instruction Pipeline 


Operation: The pipeline has four stages: IF, ID, EX, and MA (figure 7.12). Data is not returned to 
the register so there is no WB stage. | 
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7.7.2 Arithmetic Instructions 


Arithmetic Instructions between Registers (Except Multiplication Instructions): Include the 
following instruction types: . . = 


« ADD Rm, Rn 
« ADD #imm, Rn 
« ADDC Rm, Rn 
e ADDV Rm, Rn 
¢ CMP/EQ #imm, RO 
¢ CMP/EQ Rm, Rn 
¢ CMP/HS Rm, Rn 
¢ CMP/GE Rm, Rn 
¢ CMP/HI Rm, Rn 
° CMP/GT Rm, Rn 
° CMP/PZ ~— Rn 
¢ CMP/PL Rn 
¢ CMP/STR Rm, Rn 
¢ DIV1 Rm, Rn 
¢ DIVOS Rm, Rn 
¢ DIVOU 
« DT Rn (SH-2 CPU only) 
EXTS.B Rm, Rn 
¢ EXTS.W Rm, Rn 
¢ EXTU.B Rm, Rn 
¢ EXTU.W Rm, Rn 
e NEG Rm, Rn 
¢ NEGC Rm, Rn 
SUB Rm, Rn 
« SUBC Rm, Rn 
¢ SUBV Rm, Rn 
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+> <> <> <> =|} =: Slot 
instruction A IF ID EX MA 


Next instruction 


Third instruction 





Figure 7.13 Pipeline for Arithmetic Instructions between Registers Except Multiplication 
Instructions 


Operation: The pipeline has three stages: IF, ID, and EX (figure 8.13). The data operation is 
completed in the EX stage via the ALU. 
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Multiply/Accumulate Instruction (SH-1 CPU): Includes the following instruction type: 


° MAC.W @Rm+, @Rn+ 


> + + <> + |} +> @: Slot 
MAC.W IF ID EX MA MA mm mm mm 


Next instruction iF — ID EX MA WB 
Third instruction IF ID EX MA WB 





Figure 7.14 Multiply/Accumulate Instruction Pipeline 


Operation: The pipeline has eight stages: IF, ID, EX, MA, MA, mm, mm, and mm (figure 8.14). 
The second MA reads the memory and accesses the multiplier. The mm indicates that the 
multiplier is operating. The mm operates for three cycles after the final MA ends, regardless of 
slot. The ID of the instruction after the MAC.W instruction is stalled for one slot. The two MAs of 
the MAC.W instruction, when they contend with IF, split the slots as described in section 7.4, 
Contention Between Instruction Fetch (IF) and Memory Access (MA). 


When an instruction that does not use the multiplier follows the MAC.W instruction, the MAC.W 
instruction may be considered to be five-stage pipeline instructions of IF, ID, EX, MA, and MA. 
In such cases, the ID of the next instruction simply stalls one slot and thereafter the pipeline 
operates normally. When an instruction that uses the multiplier comes after the MAC.W 
instruction, contention occurs with the multiplier, so operation is not as normal. This occurs in the 
following cases: 


When a MAC.W instruction is located immediately after another MAC.W instruction 
When a MULS.W instruction is located immediately after a MAC.W instruction 

When an STS (register) instruction is located immediately after a MAC.W instruction 
When an STS.L (memory) instruction is located immediately after a MAC.W instruction 
When an LDS (register) instruction is located immediately after a MAC.W instruction 


Au pwnd = 


When an LDS.L (memory) instruction is located immediately after a MAC.W instruction 
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1. When a MAC.W instruction is located immediately after another MAC.W instruction 


When the second MA of a MAC.W instruction contends with an mm generated by a 
preceding multiplier-type instruction, the bus cycle of that MA is extended until the mm ends 
(the M—A shown in the dotted line box below) and that extended MA occupies one slot. 


If one or more instruction not related to the multiplier is located between the MAC.W 
instructions, multiplier contention between MAC instructions does not cause stalls (figure 
7.15). 


IF ID EX MA 


MAC.W 
Third instruction 


<> +> +> @- : Siot 
IF ID EX MA 
Other instruction 





Figure 7.15 Unrelated Instructions between MAC.W Instructions 


Sometimes consecutive MAC.Ws may not have multiplier contention even when MA and IF 
contention causes misalignment of instruction execution. Figure 7.16 illustrates a case of this 
type. This figure assumes MA and IF contention. 





Figure 7.16 Consecutive MAC.Ws without Misalignment 
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When the second MA of the MAC.W instruction is extended until the mm ends, contention 
between MA and IF will split the slot, as usual. Tieuters 7. 17 illustrates a case of this POPE: This 
ne assumes MA and IF contention. | 


Other instruction 
Other instruction 
Other instruction 





Figure 7.17 MA and IF Contention 
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When a MULS.W instructions is located immediately after a MAC.W instruction 


A MULS.W instruction has an MA stage for accessing the multiplier. When the MA of the 
MULS.W instruction contends with an operating MAC instruction multiplier (mm), the MA is 
extended until the mm ends (the M—A shown in the dotted line box in figure 7.18) to create a 
single slot. When two or more instructions not related to the multiplier come between the 
MAC.W and MULS.W instructions, MAC.W and MULS.W contention does not cause 
stalling. When the MULS.W MA and IF contend, the slot is split. 


<> <> <> <>: Slot 


MULS.W 
Other instruction 


<> <> <> <>: Slot 


Other instruction 


MULS.W 
Other instruction 


> <> <> <>: Slot 


Other instruction 

Other instruction — 
MULS.W 

Other instruction 





Figure 7.18 MULS.W Instruction Immediately After a MAC.W Instruction 
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3. When an STS (register) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are stored in a general-purpose register using an STS 
instruction, an MA stage for.accessing the multiplier is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.19) to create a single slot. The MA of the STS contends with the IF. Figure 7.19 
illustrates how this occurs, assuming MA and IF contention. 


| + + +> + + A —___ iv eee Slot 
MAC.W _IF 


Other instruction 
Other instruction 
Other instruction 


MAC.W _ if 


Other instruction 
Other instruction 
Other instruction 





Figure 7.19 STS (Register) Instruction Immediately After a MAC.W Instruction | 
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4. When an STS.L (memory) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are stored in memory using an STS instruction, an MA 
stage for accessing the multiplier and writing to memory is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until one state after the mm ends (the M—A shown in the dotted 
line box in figure 7.20) to create a single slot. The MA of the STS contends with the IF. 
Figure 7.20 illustrates how this occurs, assuming MA and IF contention. 


> + +> + + A PO RO OHM Slot 
MAC.W IF _ ID 
STS.L 
Other instruction EX MA 
Other instruction i ID EX 
Other instruction IF ID EX 


MAC.W _ if 
STS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.20 STS.L (Memory) Instruction Immediately After a MAC.W Instruction 
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5. When an LDS (register) instruction is located immediately after a MAC.W instruction - 


When the contents of a MAC register are loaded from a general-purpose register using an 
LDS instruction, an MA stage for accessing the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.21) to create a single slot. The MA of this LDS contends with IF. Figure 7.21 
illustrates how this occurs, assuming MA and IF contention. 


<> <> +> «+> +> + _____ Pr? 8 oo <>: Slot 
MAC.W IF a | 
LDS 
Other instruction 





Other instruction | 
Other instruction IF ID EX -- 


<> +> <> <> «> +?  @ @ @& @ @: Slot 
7 ID EX MA MA mun Ga a 


ee ee ey 


Ce ey 


Other instruction 
Other instruction 
Other instruction if — ID EX ... 





Figure 7.21 LDS (Register) Instruction Immediately After a MAC.W Instruction 
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6. When anLDS.L (memory) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are loaded from memory using an LDS instruction, an 
MA stage for accessing the memory and the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.22) to create a single slot. The MA of the LDS contends with IF. Figure 7.22 
illustrates how this occurs, assuming MA and IF contention. 


<> +> <> +> +> WH _ $1920) oo: Slot 
MAC.W IF 
LDS.L 
Other instruction 
Other instruction 
Other instruction IF ID EX 


+> +> +> +> +> +> DH DH YH HY HD @: Slot 
if 


Other instruction 
Other instruction 
Other instruction 





Figure 7.22 LDS.L (Memory) Instruction Immediately After a MAC.W Instruction 
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Multiply/Accumulate Instruction (SH-2 CPU): Includes the following instruction type: 


¢ MAC.W @Rm+, @Rn+ 


> > oe? oe Oo OO @}] @: Slot 
MAC.W IF ID EX MA MA mm mm 7 


Next instruction IF — ID EX MA WB 
Third instruction IF ID EX MA WB 





Figure 7.23 Multiply/Accumulate Instruction Pipeline 


Operation: The pipeline has seven stages: IF, ID, EX, MA, MA, mm and mm (figure 7.23). The 
second MA reads the memory and accesses the multiplier. The mm indicates that the multiplier is 
operating. The mm operates for two cycles after the final MA ends, regardless of slot. The ID of 
the instruction after the MAC.W instruction is stalled for one slot. The two MAs of the MAC.W 
instruction, when they contend with IF, split the slots as described in Section 7.4, Contention 
Between Instruction Fetch (IF) and Memory Access (MA). 


When an instruction that does not use the multiplier follows the MAC.W instruction, the MAC.W 
instruction may be considered to be a five-stage pipeline instructions of IF, ID, EX, MA, and MA. 
In such cases, the ID of the next instruction simply stalls one slot and thereafter the pipeline 
operates normally. When an instruction that uses the multiplier comes after the MAC.W 
instruction, contention occurs with the multiplier, so operation is not as normal. This occurs in the 
following cases: 


When a MAC.W instruction is located immediately after another MAC.W instruction 
When a MAC.L instruction is located immediately after a MAC.W instruction 

When a MULS.W instruction is located immediately after a MAC.W instruction 

When a DMULS.L instruction is located immediately after a MAC.W instruction 

When an STS (register) instruction is located immediately after a MAC.W instruction 
When an STS.L (memory) instruction is located immediately after a MAC.W instruction 
When an LDS (register) instruction is located immediately after a MAC.W instruction 


COS CN ee ae Ss 


When an LDS.L (memory) instruction is located immediately after a MAC.W instruction 
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1. When a MAC.W instruction is located immediately after another MAC.W instruction 


The second MA of a MAC.W instruction does not contend with an mm generated by a 
preceding multiplication instruction. 


+> +> <+> +> @: Slo 





Figure 7.24 MAC.W Instruction That Immediately Follows Another MAC.W instruction 


Sometimes consecutive MAC.Ws may have misalignment of instruction execution caused by 
MA and IF contention. Figure 7.25 illustrates a case of this type. This figure assumes MA and 
IF contention. 


+> +> + D> + 4 OO YH YH YH OY @]: Slot 


MAC.W if ID EX MA MA mmmm 
MAC.W IF — ID EX MA — MA mmmm 


MAC.W if — — ID EX MA MA mmmm 
IF — ID EX MA MA mm =.--.- 





Figure 7.25 Consecutive MAC.Ws with Misalignment 


177 


When the second MA of the MAC.W instruction contends with IF; the slot will split as usual. 
Figure 7.26 illustrates a case of this type. This figure assumes MA and IF contention. 


<> <> <> +? <> +> +> aw +> @&> +> 2: Slot 


Other instruction 
Other instruction 
Other instruction 





Figure 7.26 MA and IF Contention 
2. When aMACL instruction is located immediately after a MAC.W instruction 


| The second MA of a MAC.W instruction does not contend with an mm generated bya 
preceding multiplication instruction (figure 7.27). 


> +> <+> <+> <=: Slot 





Figure 7.27 MAC.L Instructions Immediately After a MAC.W Instruction 
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3. When a MULS.W instruction is located immediately after a MAC.W instruction 


MULS.W instructions have an MA stage for accessing the multiplier. When the MA of the 
MULS.W instruction contends with an operating MAC.W instruction multiplier (mm), the 
MA is extended until the mm ends (the M—A shown in the.dotted line box in figure 7.28) to 
create a single slot. When one or more instructions not related to the multiplier come between 
the MAC.W and MULS.W instructions, MAC.W and MULS.W contention does not cause 
stalling. There is no MULS.W MA contention while the MAC.W instruction multiplier is 
operating (mm). When the MULS.W MA and IF contend, the slot is split. 


> +> +> +> +> 4 Hr wD +H +H HD +H =H: Silo 
MAC.W IF 
MULS.W IF 
Other instruction 


MAC.W _IF 
Other instruction — ID EX | 
MULS.W IF ID EX :MA:mm mm 
Other instruction IF ID EX MA 





Figure 7.28 MULS.W Instruction Immediately After a MAC.W Instruction 
4. When a DMULS.L instruction is located immediately after a MAC.W instruction 


DMULS.L instructions have an MA stage for accessing the multiplier, but there is no 
DMULS.L MA contention while the MAC.W instruction multiplier is operating (mm). When 
the DMULS.L MA and IF contend, the slot is split (figure 7.29). 


<> +> <> +> +> +> OH HD SHOP OD SH OD SH > |: «Slot 


IF ~— ID EX MA 





Figure 7.29 DMULS.L Instructions Immediately After a MAC.W Instruction 
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5. When an STS (register) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are stored in a general-purpose register using an STS 
instruction, an MA stage for accessing the multiplier is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.30) to create a single slot. The MA of the STS contends with the IF. Figure 7.30 
illustrates how this occurs, assuming MA and IF contention. 


+> +> +> + +> —~—+_ or oD? DD D> @D: Slot 
MAC.W IF 
STS 
Other instruction IF ID EX MA 
Other instruction ID EX 
Other instruction IF ID EX 


STS 
Other instruction 
Other instruction 
Other instruction 





Figure 7.30 STS (Register) Instruction Immediately After a MAC.W Instruction 
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6. When an STS.L (memory) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are stored in memory using an STS instruction, an MA 
stage for accessing the memory and the multiplier and writing to memory is added to the STS 
instruction, as described later. Figure 7.31 illustrates how this occurs, assuming MA and IF 
contention. | 


> <> <> +> + + oH oO OD OD? @D @' Slot 
MAC.W IF 
STS.L 
Other instruction 
Other instruction 
Other instruction IF ID EX 


STS.L 
Other instruction 
Other instruction ID EX 
Other instruction if ID EX 





Figure 7.31 STS.L (Memory) Instruction Immediately After a MAC.W Instruction 


181 


7. When an LDS (register) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are loaded from a general-purpose register using an 
LDS instruction, an MA stage for accessing the multiplier is added to the LDS instruction, as 

_ described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.32) to create a single slot. The MA of this LDS contends with IF. eM 7.32 
illustrates how this occurs, assuming MA and IF contention. 


+> +> «+> +> o> + vo oO DD &: Slot 
MAC.W IF 


LDS 
Other instruction 
Other instruction 
Other instruction IF ID EX 


MAC.W if 


Other instruction | 
Other instruction ID EX 
Other instruction | | if ID EX 





Figure 7.32 LDS (Register) Instruction Immediately After a MAC.W Instruction 
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When an LDS.L (memory) instruction is located immediately after a MAC.W instruction 


When the contents of a MAC register are loaded from memory using an LDS instruction, an 
MA stage for accessing the multiplier is added to the LDS instruction, as described later. 
When the MA of the LDS instruction contends with the operating multiplier (mm), the MA is 
extended until the mm ends (the M—A shown in the dotted line box in figure 7.33) to create a 
single slot. The MA of the LDS contends with IF. Figure 7.33 illustrates how this occurs, 
assuming MA and IF contention. 


> +> +> + +> +t —_ om @H DD @D @: Slot 


Other instruction 
Other instruction 
Other instruction IF ID EX 


MAC.W if 
LDS.L IF 
Other instruction 
Other instruction 
Other instruction 





Figure 7.33  LDS.L (Memory) Instruction Immediately After a MAC.W Instruction 
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Double-Length Multiply/Accumulate Instruction (SH-2 CPU): Includes the following 
instruction type: 


¢ MAC.L @Rm+, @Rn+ (SH-2 CPU only) 


> oe HY oO oO OO H] @- : Slot 
MAC.L IF ID EX MA MA mm mm mm mm 


Next instruction IF — ID EX MA WB 
Third instruction IF ID EX MA WB 





| Figure 7.34 Multiply/Accumulate Instruction Pipeline 


Operation: The pipeline has nine stages: IF, ID, EX, MA, MA, mm, mm, mm, and mm (figure 
7.34). The second MA reads the memory and accesses the multiplier. The mm indicates that the 
multiplier is operating. The mm operates for four cycles after the final MA ends, regardless of a 
slot. The ID of the instruction after the MAC.L instruction is stalled for one slot. The two MAs of | 
the MAC.L instruction, when they contend with IF, split the slots as described in Section 7.4, 
Contention Between Instruction Fetch (IF) and Memory Access (MA). 


When an instruction that does not use the multiplier follows the MAC.L instruction, the MAC.L 
instruction may be considered to be five-stage pipeline instructions of IF, ID, EX, MA, and MA. 
In such cases, the ID of the next instruction simply stalls one slot and thereafter the pipeline 
operates normally. When an instruction that uses the multiplier comes after the MAC.L 
instruction, contention occurs with the multiplier, so operation is not as normal. This occurs in the 
following cases: 


When a MAC.L instruction is located immediately after another MAC.L instruction 
When a MAC.W instruction is located immediately after a MAC.L instruction 

When a DMULS.L instruction is located immediately after a MAC.L instruction 

When a MULS.W instruction is located immediately after a MAC.L instruction 

When an STS (register) instruction is located immediately after a MAC.L instruction 
When an STS.L (memory) instruction is located immediately after a MAC.L instruction 
When an LDS (register) instruction is located immediately after a MAC.L instruction 


Oo OS ee ee 


When an LDS.L (memory) instruction is located immediately after a MAC.L instruction 
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When a MAC.L instruction is located immediately after another MAC.L instruction 


When the second MA of the MAC.L instruction contends with the mm produced by the 
previous multiplication instruction, the MA bus cycle is extended until the mm ends (the M— 
A shown in the dotted line box in figure 7.35) to create a single slot. When two or more 
instructions that do not use the multiplier occur between two MAC.L instructions, the stall 
caused by multiplier contention between MAC.L instructions is eliminated. 


+> + + > @& @: Slot 


MAC.L IF 
MAC.L IF 
Third instruction 


MAC.L IF ID EX MA MA 
Other instruction IF — ID EX MA WB 
Other instruction IF ID EX MA WB 
IF 





Figure 7.35 MAC.L Instruction Immediately After Another MAC.L Instruction 


Sometimes consecutive MAC.Ls may have less multiplier contention even when there is 
misalignment of instruction execution caused by MA and IF contention. Figure 7.36 
illustrates a case of this type, assuming MA and IF contention. 


> +> +? o> + qa «+ | Ww —_ > @ <>: Slot 
if ID EX MA MA mm mm: 


A mm mm mm mm 
ID EX — — MA 





Figure 7.36 Consecutive MAC.Ls with Misalignment 
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When the second MA of the MAC.L instruction is extended to the end of the mm, contention 
between the MA and IF will split the slot in the usual way. Fi gure 7.37 illustrates a case e of 
this type, pstinine MA and IF contention. 


<> <> «> «> oo + __ > vw oo: Slot 


Other intruction | 
Other intruction | i , ID 
Other intruction _ 3 | : IF 





Figure 7.37 MA and IF Contention 
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2. When a MAC.W instruction is located immediately after a MAC.L instruction 


When the second MA of the MAC.W instruction contends with the mm produced by the 
previous multiplication instruction, the MA bus cycle is extended until the mm ends (the M— 
A shown in the dotted line box in figure 7.38) to create a single slot. When two or more 
instructions that do not use the multiplier occur between the MAC.L and MAC.W 
instructions, the stall caused by multiplier contention between MAC.L instructions is 
eliminated. 


+> +> +> + o> : Slot 
MAC.L IF !ID EX MA 
MAC.W 
Third instruction 


MAC.L IF ID EX MA 
Other instruction EX MA WB 
Other instruction ID EX MA WB 
IF 





Figure 7.38 MAC.W Instruction Immediately After a MAC.L Instruction 
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3. When a DMULS.L instruction is located immediately after a MAC.L instruction 


DMULS.L instructions have an MA stage for accessing the multiplier. When the second MA 
of the DMULS.L instruction contends with an operating MAC.L instruction multiplier (mm), 
the MA is extended until the mm ends (the M—A shown in the dotted line box in figure 7.39) 
to create a single slot. When two or more instructions not related to the multiplier come 

between the MAC.L and DMULS.L instructions, MAC.L and DMULS.L contention does not 
cause stalling. When the DMULS.L MA and IF contend, the slot is split. 


MAC.L IF 
DMULS.L 
Other instruction 


MAC.L IF 
Other instruction IF 
DMULS.L 
Other instruction 


<> <> <> : Slot 


MAC.L IF 
Other instruction IF 
Other instruction 
DMULS.L 
Other instruction 





Figure 7.39 DMULS.L Instruction Immediately After a MAC.L Instruction 
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When a MULS.W instruction is located immediately after a MAC.L instruction 


MULS.W instructions have an MA stage for accessing the multiplier. When the MA of the 
MULS.W instruction contends with an operating MAC.L instruction multiplier (mm), the MA 
is extended until the mm ends (the M—A shown in the dotted line box in figure 7.40) to 
create a single slot. When three or more instructions not related to the multiplier come 
between the MAC.L and MULS.W instructions, MAC.L and MULS.W contention does not 
cause stalling. When the MULS.W MA and IF contend, the slot is split. 
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Other instruction IF 
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MAC.L IF mm.mm: 
Other instruction IF 
Other instruction 
MULS.W = IF ID EX :M——A: 
Otherinstruction = WF ID EX — MA = 


Pr ee ey 


MAC.L IF ID EX MA MA ‘mm: 
Other instruction IF — 
Other instruction 
Other instruction 
MULS.W ERO eee 
Other instruction ©  |F ID EX MA. es: 





Figure 7.40 MULS.W Instruction Immediately After a MAC.L Instruction 
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5. When an STS (register) instruction is located immediately after a MAC.L instruction 


When the contents of a MAC register are stored in a general-purpose register using an STS 
instruction, an MA stage for accessing the multiplier is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.41) to create a single slot. The MA of the STS contends with the IF. Figure 7.41 
illustrates how this occurs, assuming MA and IF contention. 


<> +> <> + > He mR 0: Slot 
MAC.L IF ID 


Other instruction 


Other instruction 
Other instruction 


MAC.L _ if 


Other instruction 
Other instruction 


Other instruction 





Figure 7.41 STS (Register) Instruction Immediately After a MAC.L Instruction 
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6. When an STS.L (memory) instruction is located immediately after a MAC.L instruction 


When the contents of a MAC register are stored in memory using an STS instruction, an MA 
stage for accessing the multiplier and writing to memory is added to the STS instruction, as 
described later. The MA of the STS contends with the IF. Figure 7.42 illustrates how this 
occurs, assuming MA and IF contention. 


+> <> <> +> > 2 Oo #18 He Slot 
MAC.L IF 
STS.L 
Other instruction 
Other instruction 
Other instruction 


MAC.L if 

STS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.42 STS.L (Memory) Instruction Immediately After a MAC.L Instruction 
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7. When an LDS (register) instruction is located immediately after a MAC.L instruction 


When the contents of a MAC register are loaded from a general-purpose register using an 
LDS instruction, an MA stage for accessing the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.43) to create a single slot. The MA of this LDS contends with IF. Figure 7.43 
illustrates how this occurs, assuming MA and IF contention. 


> +> <> +> +> + Sar 49049 oe: Slot 
MAC.L IF 
LDS 
Other instruction 
Other instruction 
Other instruction IF ID EX 


+> +> <> +> +> +w —__ ro oo > @ @-: Slot 
MAC.L _ if 
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Other instruction 
Other instruction 
Other instruction 





Figure 7.43 LDS (Register) Instruction Immediately After a MAC.L Instruction 
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8. When an LDS.L (memory) instruction is located immediately after a MAC.L instruction 


When the contents of a MAC register are loaded from memory using an LDS instruction, an 
MA stage for accessing the memory and the memory and the multiplier is added to the LDS 
instruction, as.described later. When the MA of the LDS instruction contends with the 
operating multiplier (mm), the MA is extended until the mm ends (the M—A shown in the 
dotted line box in figure 7.44) to create a single slot. The MA of the LDS contends with IF. 
Figure 7.44 illustrates how this occurs, assuming MA and IF contention. 


+> +> +> + +> —- A mH HD: Slot 
MAC.L IF ID 
LDS.L 
Other instruction 
Other instruction 
Other instruction IF ID EX 


LDS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.44 LDS.L (Memory) Instruction Immediately After a MAC.L Instruction 
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Multiplication Instructions (SH-1 CPU): Include the following instruction types: 


¢ MULS.W Rm, Rn 
¢ MULU.W Rm, Rn 


+> +> + + o> > @&] @: Slot 
MULS.W IF ID EX MA mm mm mm 


Next instruction IF ID EX MA WB 
Third instruction IF ID EX MA WB 





Figure 7.45 Multiplication Instruction Pipeline 


Operation: The pipeline has seven stages: IF, ID, EX, MA, mm, mm, and mm (figure 8.45). The 

MA accesses the multiplier. The mm indicates that the multiplier is operating. The mm operates 
for three cycles after the MA ends, regardless of a slot. The MA of the MULS.W instruction, when 
it contends with IF, splits the slot as described in Section 7.4, Contention Between Instruction 
Fetch (IF) and Memory Access (MA). 


When an instruction that does not use the multiplier comes after the MULS.W instruction, the 
MULS.W instruction may be considered to be four-stage pipeline instructions of IF, ID, EX, and 
MA. In such cases, it operates like a normal pipeline. When an instruction that uses the multiplier 
comes after the MULS.W instruction, however, contention occurs with the multiplier, so operation 
is not as normal. This occurs in the following cases: 


When a MAC.W instruction is located immediately after a MULS.W instruction 

When a MULS.wW instruction is located immediately after another MULS.W instruction 
When an STS (register) instruction is located immediately after a MULS.W instruction 
When an STS.L (memory) instruction is located immediately after a MULS.W instruction 
When an LDS (register) instruction is located immediately after a MULS.W instruction 
When an LDS.L (memory) instruction is located immediately after a MULS.W instruction 


OY tee oe 
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1. When a MAC.W instruction is located immediately after a MULS.W instruction 


When the second MA of a MAC.W instruction contends with the mm generated by a 
preceding multiplication instruction, the bus cycle of that MA is extended until the mm ends 
(the M—A shown in the dotted line box below) and that extended MA occupies one slot. 


If one or more instructions not related to the multiplier comes between the MULS.W and 
MAC.W instructions, multiplier contention between the MULS.W and MAC.W instructions 
does not cause stalls (figure 7.46). 


>> + +> + + oH HD DH HD @-: Slot 
MULS.W IF 
MAC.W 
Third instruction 


MULS.W IF 
Other instruction IF ID EX MA WB 
IF ID EX MA:MA: mm mm mm 





Figure 7.46 MAC.W Instruction Immediately After a MULS.W Instruction 
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When a MULS.W instruction is located immediately after another MULS.W instruction 


MULS.W instructions have an MA stage for accessing the multiplier. When the MA of the 
MULS.W instruction contends with the operating multiplier (mm) of another MULS.W 
instruction, the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.47) to create a single slot. When two or more instructions not related to the multiplier 
are located between the two MULS.W instructions, contention between the MULS.Ws does 
not cause stalling. When the MULS.W MA and IF contend, the slot is split. 


> +> +> +> + vo vo oD oD DD DD @: Slot 
MULS.W IF 
MULS.W 
Other instruction 


MULS.W IF 
Other instruction 

MULS.W 
Other instruction 


MULS.W IF 
Other instruction IF ID EX MA WB 
Other instruction IF ID EX MA WB 
MULS.W IF 
Other instruction 





Figure 7.47 MULS.W Instruction Immediately After Another MULS.W Instruction 
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When the MA of the MULS.W instruction is extended until the mm ends, contention between 
MA and IF will split the slot, as is normal. Figure 7.48 illustrates a case of this type, assuming 
MA and IF contention. © 


+> +> + + + _Y vvvvve: Slot 


Other instruction 
~ Other instruction 
Other instruction 





Figure 7.48 MULS.W Instruction Immediately After Another MULS.W Instruction (IF and 
MA Contention) 
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3. When an STS (register) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are stored in a general-purpose register using an STS 
instruction, an MA stage for accessing the multiplier is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.49) to create a single slot. The MA of the STS contends with the IF. Figure 7.49 
illustrates how this occurs, assuming MA and IF contention. 


+> + <> + w__—___-> vo wo @ DH DD @: Slot 
MULS.W IF 
STS 
Other instruction 
Other instruction 
Other instruction 


> <> +> <> <> <>: Slot 


if 
STS 
Other instruction 
Other instruction 
Other instruction 





Figure 7.49 STS (Register) Instruction Immediately After a MULS.W Instruction 
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4. When an STS.L (memory) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are loaded from memory using an STS instruction, an 
MA stage for accessing the multiplier and writing to memory is added to the STS instruction, 
as described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until one cycle after the mm ends (the M—A shown in the dotted 
line box in figure 7.50) to create a single slot. The MA of the STS contends with the IF. 
Figure 7.50 illustrates how this occurs, assuming MA and IF contention. 


+> <> +> <> wo vo o> o> OD @& @: Slot 
MULS.W IF 
STS.L 
Other instruction 
Other instruction 
Other instruction 


STS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.50 STS.L (Memory) Instruction Immediately After a MULS.W Instruction 
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5. When an LDS (register) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are loaded from a general-purpose register using an 
LDS instruction, an MA stage for accessing the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box below) 
to create a single slot. The MA of this LDS contends with IF. Figure 7.51 illustrates how this 
occurs, assuming MA and IF contention. 


> + + +> + —_? oH oo @ wo @ <>: Slot 
MULS.W IF 
LDS 
Other instruction 
Other instruction 
Otherinstruction TF ID EX + 


eee e twee e saree e eer oreo ee 


Beer eet eerreesrenresseves 


Pa 


ee 


LDS IF 
Other instruction 
Other instruction 
Other instruction if — ID EX ev 


ee ey 





Figure 7.51 LDS (Register) Instruction Immediately After a MULS.W Instruction 
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6. When an LDS.L (memory) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are loaded from memory using an LDS instruction, an 
MA stage for accessing the memory and the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.52) to create a single slot. The MA of the LDS contends with IF. Figure 7.52 
illustrates how this occurs, assuming MA and IF contention. 


<> <> +> +> +___ om > @ > @-: Slot 
MULS.W IF 
LDS.L 
Other instruction 
Other instruction 
Other instruction 


LDS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.52 LDS.L (Memory) Instruction Immediately After a MULS.W Instruction 
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Multiplication Instructions (SH-2 CPU): Include the following instruction types: 


¢ MULS.W Rm, Rn 
¢ MULU.W Rm, Rn 


+> + + +} @] | &Y @ :: Slot 
MULS.W IF ID EX MA mm mm 


Next instruction IF ID EX MA WB 
Third instruction IF ID EX MA WB 





Figure 7.53 Multiplication Instruction Pipeline 


Operation: The pipeline has six stages: IF, ID, EX, MA, mm, and mm (figure 8.53). The MA 
accesses the multiplier. The mm indicates that the multiplier is operating. The mm operates for 
two cycles after the MA ends, regardless of the slot. The MA of the MULS.W instruction, when it 
contends with IF, splits the slot as described in Section 7.4, Contention Between Instruction Fetch 
(IF) and Memory Access (MA). 


When an instruction that does not use the multiplier comes after the MULS.W instruction, the 
MULS.W instruction may be considered to be four-stage pipeline instructions of IF, ID, EX, and 
MA. In such cases, it operates like a normal pipeline. When an instruction that uses the multiplier 
is located after the MULS.W instruction, however, contention occurs with the multiplier, so 
Operation is not as normal. This occurs in the following cases: 


When a MAC.W instruction is located immediately after a MULS.W instruction 

When a MAC.L instruction is located immediately after a MULS.W instruction 

When a MULS.W instruction is located immediately after another MULS.W instruction 
When a DMULS.L instruction is located immediately after a MULS.W instruction 

When an STS (register) instruction is located immediately after a MULS.W instruction 
When an STS.L (memory) instruction is located immediately after a MULS.W instruction 
When an LDS (register) instruction is located immediately after a MULS.W instruction 


ea ed oe a a 


When an LDS.L (memory) instruction is located immediately after a MULS.W instruction 
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1. When a MAC.W instruction is located immediately after a MULS.W instruction 


The second MA of a MAC.W instruction does not contend with the mm generated by a 
preceding multiplication instruction. 


>? or oD Or OD OH HD OD DH: Slot 
MULS.W IF ID EX MA mm mm 


MAC.W IF ID EX MA MA mm mm 
Third instruction IF — ID EX MA 





Figure 7.54 MAC.W Instruction Immediately After a MULS.W Instruction 
2. When a MACLL instruction is located immediately after a MULS.W instruction 


The second MA of a MAC.W instruction does not contend with the mm generated by a 
preceding multiplication instruction. 


+> +> + +> +> +] |] LY HD |] J]: Siot 


MULS.W IF ID EX MA mm mm 


MAC.L IF ID EX MA MA mm mm mm mm 
Third instruction — IF — ID EX MA ---- 





Figure 7.55 MAC.L Instruction Immediately After a MULS.W Instruction 
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3. When a MULS.W instruction is located immediately after another MULS.W instruction 


MULS.W instructions have an MA stage for accessing the multiplier. When the MA of the 
MULS.W instruction contends with the operating multiplier (mm) of another MULS.W 
instruction, the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.56) to create a single slot. When one or more instructions not related to the multiplier 
is located between the two MULS.W instructions, contention between the MULS.Ws does not 
cause stalling. When the MULS.W MA and IF contend, the slot is split. 


> <> + + > + @: Slot 


MULS.W IF 
MULS.W 
Other instruction IF ID EX 


> +> + + + + @: Slot 
MULS.W IF 
Other instruction 
MULS.W 
Other instruction IF ID EX 





Figure 7.56 MULS.W Instruction Immediately After Another MULS.W Instruction 


When the MA of the MULS.W instruction is extended until the mm ends, contention between 
the MA and IF will split the slot in the usual way. Figure 7.57 illustrates a case of this type, 
assuming MA and IF contention. 


+> +> +> +> qa _—_ vt > D> > DH @-: Slot 


Other instruction IF 
Other instruction 


Other instruction 





Figure 7.57 MULS.W Instruction Immediately After Another MULS.W Instruction (IF and 
MA contention) 
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4. When a DMULS.L instruction is located immediately after a MULS.W instruction 


Though the second MA in the DMULS.L instruction makes an access to the multiplier, it does 
not contend with the operating multiplier (mm) generated by the MULS.W instruction. — 


+> +) + Yo | HY OO DY OO OH DY @-: Slo 


MULS.W IF ID EX MA mm mm | 


DMULS.L IF ID EX MA MA mm mm mm mm 
Other instruction 





Figure 7.58 DMULS.L Instruction Immediately After a MULS.W Instruction | 
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5. When an STS (register) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are stored in a general-purpose register using an STS 
instruction, an MA stage for accessing the multiplier is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.59) to create a single slot. The MA of the STS contends with the IF. Figure 7.59 
illustrates how this occurs, assuming MA and IF contention. 


<q +> <> <> +> <=> @: Slot 


STS 
Other instruction 
Other instruction 
Other instruction IF ID EX 


+> +> + + + @D @:: Slot 
if 
STS 
Other instruction 
Other instruction 
Other instruction 





Figure 7.59 STS (Register) Instruction Immediately After a MULS.W Instruction 


207 


6. When an STS.L (memory) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are stored in memory using an STS instruction, an MA 
stage for accessing the multiplier and writing to memory is added to the STS instruction, as 
described later. The MA of the STS contends with the IF. Figure 7.60 illustrates how this 
occurs, assuming MA and IF contention. 


+ > +> +> + oH oo @ DD @D @:: Slot 
MULS.W IF 
STS.L 
Other instruction 
Other instruction 
Other instruction IF ID EX 


+> +> +> ++ oo DD OD OD OD DY @: Slot 
if 
STS.L IF 
Other instruction 
Other instruction 
Other instruction 





Figure 7.60 STS.L (Memory) Instruction Immediately After a MULS.W Instruction 
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7. When an LDS (register) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are loaded from a general-purpose register using an 
LDS instruction, an MA stage for accessing the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box below) 
to create a single slot. The MA of this LDS contends with IF. The following figures illustrates 
how this occurs, assuming MA and IF contention. 


+> +> +> +> <> <=> @-: Slot . 


LDS 
Other instruction 
Other instruction 
Other instruction IF ID EX 


> + oe > @ @:: Slot 
if 
LDS 
Other instruction 
Other instruction 
Other instruction 





Figure 7.61 LDS (Register) Instruction Immediately After a MULS.W Instruction 
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8. When an LDS.L (memory) instruction is located immediately after a MULS.W instruction 


When the contents of a MAC register are loaded from memory using an LDS instruction, an 
MA stage for accessing the multiplier is added to the LDS instruction, as described later. 
When the MA of the LDS instruction contends with the operating multiplier (mm), the MA is 

~- extended until the mm ends (the M—A shown in the dotted line box in figure 7.62) to create a 
single slot. The MA of the LDS contends with IF. Figure 7.62 illustrates how this occurs, 
assuming MA and IF contention. 


ee ee eee wee 
IF 
LDS.L 
Other instruction 
Other instruction 
Other instruction : IF ID EX 


<> <> <> <> <> <> >: Siot 


if 
LDS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.62 LDS.L (Memory) Instruction Immediately After a MULS.W Instruction 
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Double-Length Multiplication Instructions (SH-2 CPU): Include the following instruction 
types: 


¢ DMULS.L Rm, Rn (SH-2 CPU only) 
¢ DMULU.L ~~ Rm, Rn (SH-2 CPU only) 
¢ MUL.L Rm, Rn (SH-2 CPU only) 


> +? + +} HY YH YH LY OO |] DP: Slot 
DMULS.L IF ID EX MA MA mm mm mm mm 


Next instruction IF ~— ID EX MA WB 
Third instruction IF ID EX MA WB 





Figure 7.63 Multiplication Instruction Pipeline 


The pipeline has nine stages: IF, ID, EX, MA, MA, mm, mm, mm, and mm (figure 7.63). The MA 
accesses the multiplier. The mm indicates that the multiplier is operating. The mm operates for 
four cycles after the MA ends, regardless of a slot. The ID of the instruction following the 
DMULS.L instruction is stalled for 1 slot (see the description of the multiply/accumulate 
instruction). The two MA stages of the DMULS.L instruction, when they contend with IF, split the 
slot as described in section 7.4, Contention Between Instruction Fetch (IF) and Memory Access 
(MA). 


When an instruction that does not use the multiplier comes after the DMULS.L instruction, the 
DMULS.L instruction may be considered to be a five-stage pipeline instruction of IF, ID, EX, 
MA, and MA. In such cases, it operates like a normal pipeline. When an instruction that uses the 
multiplier comes after the DMULS.L instruction, however, contention occurs with the multiplier, 
SO Operation is not as normal. This occurs in the following cases: 


When a MAC.L instruction is located immediately after a DMULS.L instruction 

When a MAC.W instruction is located immediately after a DMULS.L instruction 

When a DMULS.L instruction is located immediately after another DMULS.L instruction 
When a MULS.W instruction is located immediately after a DMULS.L instruction 

When an STS (register) instruction is located immediately after a DMULS.L instruction 
When an STS.L (memory) instruction is located immediately after a DMULS.L instruction 
When an LDS (register) instruction is located immediately after a DMULS.L instruction 


Se JOY, ee? ee 


When an LDS.L (memory) instruction is located immediately after a DMULS.L instruction 
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1. Whena MACLL instruction is located immediately after a DMULS.L instruction 


When the second MA of a MAC.L instruction contends with the mm generated by a 
preceding multiplication instruction, the bus cycle of that MA is extended until the mm ends 
(the M—A shown in the dotted line box below) and that extended MA occupies one slot. 


If two or more instructions not related to the multiplier are located between the DMULS.L 
and MAC.L instructions, multiplier contention between the DMULS.L and MAC.L 
instructions does not cause stalls (figure 7.64). | 


<+—_—_—_ > +> + +> 2: Slot 
DMULS.L IF 


MAC.L —_——A :mm mm mm mm 
Third instruction 


<+> +> <> <>: Slot 


DMULS.L IF ID EX MA MA mm 

Other instruction IF — ID EX MA 

Other instruction IF ID EX 
IF mm mm mm mm 





Figure 7.64 MAC.L Instruction Immediately After a DMULS.L Instruction 
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2. When a MAC.W instruction is located immediately after a DMULS.L instruction 


When the second MA of a MAC.W instruction contends with the mm generated by a 
preceding multiplication instruction, the bus cycle of that MA is extended until the mm ends 
(the M—A shown in the dotted line box below) and that extended MA occupies one slot. 


If two or more instructions not related to the multiplier are located between the DMULS.L 
and MAC.W instructions, multiplier contention between the DMULS.L and MAC.W 
instructions does not cause stalls (figure 7.65). 


<--> + > &> @: Slot 
DMULS.L IF 


MAC.W 
Third instruction 


<> <> <> <>: Siot 


DMULS.L IF ID EX MA MA mm 
Other instruction IF — ID EX MA 
Other instruction IF ID EX 


IF ID 





Figure 7.65 MAC.W Instruction Immediately After a DMULS.L Instruction 
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When a DMULS.L instruction is located immediately after another DMULS.L instruction 


DMULS.L instructions have an MA stage for accessing the multiplier. When the MA of the 
DMULS.L instruction contends with the operating multiplier (mm) of another DMULS.L 
instruction, the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.66) to create a single slot. When two or more instructions not related to the multiplier 
are located between two DMULS.L instructions, contention between the DMULS.Ls does not 
cause stalling. When the DMULS.L MA and IF contend, the slot is split. 


DMULS.L 
DMULS.L 
Other instruction 


DMULS.L 
Other instruction 
DMULS.L 
Other instruction 


DMULS.L 
Other instruction 
Other instruction 
DMULS.L 
Other instruction 


IF 


IF 


IF 


+> +> <> +> <> @: Slot 


ID EX MA MA 


IF 


ID EX MA MA 
IF 





Figure 7.66 DMULS.L Instruction Immediately After Another DMULS.L Instruction 
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When the MA of the DMULS.L instruction is extended until the mm ends, contention 
between the MA and IF will split the slot in the usual way. Figure 7.67 illustrates a case of 
this type, assuming MA and IF contention. 


> > > +> +” + ___——__~P @ @ @ : Slot 


DMULS.L 


Other instruction 
Other instruction 
Other instruction 





Figure 7.67 DMULS.L Instruction Immediately After Another DMULS.L Instruction (IF 
and MA Contention) 
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4. When a MULS.W instruction is located immediately aftera DMULS.L instruction _ 


MULS.W instructions have an MA stage for accessing the multiplier. When the MA of the 
MULS.W instruction contends with the operating multiplier (mm) of a DMULS.L instruction, 
the MA is extended until the mm ends (the M—A shown in the dotted line box in figure 7.68) 
to create a single slot. When three or more instructions not related to the multiplier are located 
between the DMULS.L instruction and the MULS.W instruction, contention between the 
DMULS.L and MULS.W does not cause stalling. When the MULS.W MA and IF contend, | 
the slot is split. . 


> + or oo > wt —_—_ 7 @ om @ @ : Slot 
DMULS.L IF 
MULS.W 
Other instruction 


DMULS.L IF 
Other instruction 
Other instruction 
Other instruction 
MULS.W 
Other instruction 





Figure 7.68 MULS.W Instruction Immediately After a DMULS.L Instruction 


When the MA of the DMULS.L instruction is extended until the mm ends, contention 
between the MA and IF will split the slot in the usual way. Figure 7.69 illustrates a case of 
this type, assuming MA and IF contention. 


Beemer eer sere nr eer r errno nsvesenseone 


Otherinstruction  # £J|F Ip — — — — EX MA... 
Other instruction | a eee |B es > eer 
Other instruction I) | 30 | eer 





Figure 7.69 MULS.W Instruction Immediately After a DMULS.L Instruction (IF and MA 
| Contention) 
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5. When an STS (register) instruction is located immediately after a DMULS.L instruction 


When the contents of a MAC register are stored in a general-purpose register using an STS 
instruction, an MA stage for accessing the multiplier is added to the STS instruction, as 
described later. When the MA of the STS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.70) to create a single slot. The MA of the STS contends with the IF. Figure 7.70 
illustrates how this occurs, assuming MA and IF contention. 


<> +> +> +> #_ tH PO 8 OOD: Slt 
DMULS.L IF 


Other instruction 
Other instruction 
Other instruction 


DMULS.L if 
STS 
Other instruction 
Other instruction 
Other instruction 





Figure 7.70 STS (Register) Instruction Immediately After a DMULS.L Instruction 
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6. When an STS.L (memory) instruction is located immediately after a DMULS.L instruction 


When the contents of a MAC register are stored in memory using an STS instruction, an MA 
stage for accessing the multiplier and writing to memory is added to the STS instruction, as 
described later. The MA of the STS contends with the IF. oe 7.71 illustrates how this 
occurs, assuming MA and IF contention. : 


STS.L 
Other instruction 


Other instruction 
Other instruction IF ID EX 


DMULS.L if 
STS.L 
Other instruction 
Other instruction 
Other instruction 





Figure 7.71 STS.L (Memory) Instruction Immediately After a DMULS.L Instruction 
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7. When an LDS (register) instruction is located immediately after a DMULS.L instruction 


When the contents of a MAC register are loaded from a general-purpose register using an 
LDS instruction, an MA stage for accessing the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box below) 
to create a single slot. The MA of this LDS contends with IF. The following figure illustrates 
how this occurs, assuming MA and IF contention. 


<> +> +> +> +> ov vo o> & o @ : 6Slot 
DMULS.L IF 
LDS 
Other instruction 
Other instruction 
Other instruction IF ID EX 


+> + +> + > He __? oo @ @ @ : Slot 
DMULS.L if 
LDS 
Other instruction 
Other instruction 
Other instruction 





Figure 7.72 LDS (Register) Instruction Immediately After a DMULS.L Instruction 
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8. When an LDS.L (memory) instruction is located immediately after a DMULS.L instruction 


When the contents of a MAC register are loaded from memory using an LDS instruction, an 

_ MA stage for accessing the memory and the multiplier is added to the LDS instruction, as 
described later. When the MA of the LDS instruction contends with the operating multiplier 
(mm), the MA is extended until the mm ends (the M—A shown in the dotted line box in 
figure 7.73) to create a single slot. The MA of the LDS contends with IF. Figure 7.73 
illustrates how this occurs, assuming MA and IF contention. 


<> +> +> + + —___? 2 @ @& @» Slot 
DMULS.L IF | 
LDS.L 
Other instruction 
Other instruction 
Other instruction , IF ID EX 





eee err nest eeerer ere seeeeee 


DMULS.L if 
LDS.L 
Other instruction 
Other instruction 
Other instruction 


Bere or rere essere seesecese 


pee een ee eeeerereersereece 


CO 





Figure 7.73 LDS.L (Memory) Instruction Immediately After a DMULS.L Instruction 
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7.7.3 Logic Operation Instructions 


Register-Register Logic Operation Instructions: Include the following instruction types: 


e AND Rm, Rn 
¢ AND #imm, RO 
e NOT Rm, Rn 
¢ OR’ Rm,Rn 
¢ OR #imm, RO 
¢ TST Rm,Rn 
¢ TST #imm, RO 
¢ XOR Rm,Rn 
« XOR #imm, RO 


> +> >? @&? &> @ : Siot 
InstructionA IF ID EX 
Next instruction 


Third instruction 





Figure 7.74 Register-Register Logic Operation Instruction Pipeline 


Operation: The pipeline has three stages: IF, ID, and EX (figure 8.74). The data operation is 
completed in the EX stage via the ALU. 


221 


Memory Logic Operation Instructions: Include the following instruction types: 


¢ AND.B #imm, @(RO, GBR) 
¢ OR.B #imm, @(RO, GBR) 
¢ TST.B = #imm, @(RO, GBR) 
¢ XOR.B- #imm, @(RO, GBR) 


> + > + +} &Y > |} @: Slot 
Instruction A IF ID EX MA EX MA | 


~ Next instruction 
Third instruction 





Figure 7.75 Memory Logic Operation Instruction Pipeline 


Operation: Operation: The pipeline has six stages: IF, ID, EX, MA, EX, and MA (figure 7.75). 
The ID of the next instruction stalls for 2 slots. The MAs of these instructions contend with IF. 
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TAS Instruction: Includes the following instruction type: 


¢ TAS.B  @Rn 


ear or OD DO HD HD: Slot 
InstructionA IF ID EX MA EX MA 


Next instruction iF — 
Third instruction 





Figure 7.76 TAS Instruction Pipeline 


Operation: The pipeline has six stages: IF, ID, EX, MA, EX, and MA (figure 7.76). The ID of the 
next instruction stalls for 3 slots. The MA of the TAS instruction contends with IF. 
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7.7.4 Shift Instructions 


Shift Instructions: Include the following instruction types: 


¢ ROTL Rn 
¢ ROTR Rn 
¢ ROTCL Rn 
¢ ROTCR’- Rn 
¢ SHAL Rn 
e SHAR Rn 
e SHLL Rn 
¢ SHLR Rn 
¢ SHLL2 ~~ Rn 
¢ SHLR2 Rn 
¢ SHLL8 = Rn 
¢ SHLR8 - Rn 
¢ SHLLI6 Rn 
¢ SHLR1I6 Rn 


>> > > +> o> | +> @} @-: Slo 
InstructionA iF ID EX 
Next instruction IF . 


Third instruction 





Figure 7.77 Shift Instruction Pipeline 


Operation: The pipeline has three stages: IF, ID, and EX (figure 7.77). The data operation is 
completed in the EX stage via the ALU. 
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7.7.5 Branch Instructions 


Conditional Branch Instructions: Include the following instruction types: 


e BF _iabel 
e BT label 


Operation: The pipeline has three stages: IF, ID, and EX. Condition verification is performed in 
the ID stage. Conditional branch instructions are not delayed branch. 


1. When condition is satisfied 


The branch destination address is calculated in the EX stage. The two instructions after the 
conditional branch instruction (instruction A) are fetched but discarded. The branch 
destination instruction begins its fetch from the slot following the slot which has the EX stage 
of instruction A (figure 7.78). 


<> <> +? + + HY YH YH DD: Slot 
Instruction A IF ID EX 
Next instruction IF — (Fetched but discarded) 


Third instruction iF — (Fetched but discarded) 
Branch destination — IF ID EX. cw 





Figure 7.78 Branch Instruction When Condition is Satisfied 
2. When condition is not satisfied 


If it is determined that conditions are not satisfied at the ID stage, the EX stage proceeds 
without doing anything. The next instruction also executes a fetch (figure 7.79). 


+> > + HS +H HH HY DH: Slot 
InstructonA IF ID EX 
Next instruction IF 


Third instruction 





Figure 7.79 Branch Instruction When Condition is Not Satisfied 
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Note: SH-2 always fetches instructions with a long word. Therefore, ''1. When condition is 
satisfied", 2 instructions are overrun when fetched, if that address is at the boundary of the 
4n address. | OO | 
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Delayed Conditional Branch Instructions (SH-2 CPU): Include the following instruction types: 


¢ BF/S label (SH-2 CPU only) 
¢ BT/S label (SH-2 CPU only) 


Operation: The pipeline has three stages: IF, ID, and EX. Condition verification is performed in 
the ID stage. 


1. 


When condition is satisfied 


The branch destination address is calculated in the EX stage. The instruction after the 
conditional branch instruction (instruction A) is fetched and executed, but the instruction after 
that is fetched and discarded. The branch destination instruction begins its fetch from the slot 
following the slot which has the EX stage of instruction A (figure 7.80). 


<> <> <> <+? <)> +> <> +> «= 2? : Slot 


Instruction A IF ID EX 


Next instruction IF — ID EX MA WB 


Third instruction IF — (Fetched but discarded) 
Branch destination IF 





Figure 7.80 Branch Instruction When Condition is Satisfied 


2. When condition is not satisfied 


If it is determined that conditions are not satisfied at the ID stage, the EX stage proceeds 
without doing anything. The next instruction also executes a fetch (figure 7.81). 


> + +> > > + > | @: Slot 
Instruction A IF ID EX 
Next instruction 


Third instruction 





Figure 7.81 Branch Instruction When Condition is Not Satisfied 
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Note: SH-2 always fetches instructions with a long word. Therefore, '1. When condition is 
satisfied", 2 instructions are overrun when fetched, if that address is at the boundary of the 
4n address. | 
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Unconditional Branch Instructions: Include the following instruction types: 


¢ BRA label 

¢ BRAF Rm (SH-2 CPU only) 
¢ BSR label 

¢ BSRF Rm (SH-2 CPU only) 
¢ JMP @Rm 

¢ JSR @Rm 

¢ RTS 


+ > +? oo OO OO OD @&D @ : Slot 
InstructionA IF ID EX 
Delay slot IF — ID EX 


Branch destination 





Figure 7.82 Unconditional Branch Instruction Pipeline 


Operation: The pipeline has three stages: IF, ID, and EX (figure 7.82). Unconditional branch 
instructions are delayed branch. The branch destination address is calculated in the EX stage. The 
instruction following the unconditional branch instruction (instruction A), that is, the delay slot 
instruction is fetched and not discarded as the conditional branch instructions are, but is then 
executed. Note that the ID slot of the delay slot instruction does stall for one cycle. The branch 
destination instruction starts its fetch from the slot after the slot that has the EX stage of 
instruction A. 
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7.7.6 System Control Instructions 


System Control ALU Instructions: Include the following instruction types: 


* CLRT 

- LDC Rm,SR 
- LDC Rm, GBR 
* LDC Rm, VBR 
* LDS Rm,PR 
* NOP 

+ SEIT 

* STC SR,Rn 

- STC GBR, Rn 
* STC VBR,Rn 
* STS PR,Rn 


> > oY > HY HD HD DH: Slo 
InstructionA IF. ID EX 
Next instruction 


Third instruction 





Figure 7.83 System Control ALU Instruction Pipeline 


Operation: The pipeline has three stages: IF, ID, and EX (figure 7.83). The data operation is 
completed in the EX stage via the ALU. 
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LDC.L Instructions: Include the following instruction types: 


e LDC.L @Rm+, SR 
e LDC.L @Rm+, GBR 
¢ LDC.L @Rm+, VBR 


> + +? > +> + + HS] @ : Slot 
Instruction A IF ID EX MA EX] 


Next instruction IF — — ID EX 
Third instruction IF 





Figure 7.84 LDC.L Instruction Pipeline 


Operation: The pipeline has five stages: IF, ID, EX, MA, and EX (figure 7.84). The ID of the 
following instruction is stalled for two slots. | 
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STC.L Instructions: Include the following instruction types: 


e STC.L SR,@-Rn 
¢ STC.L GBR, @-Rn 
¢ STC.L VBR,@-Rn 


<-> > > |} |] <> <>: Siot 
Instruction A IF. ID. EX MAI 


Next instruction iF — 
Third instruction IF ID EX 





Figure 7.85 STC.L Instruction Pipeline 


Operation: The pipeline has four stages: IF, ID, EX, and MA (figure 7.85). The ID of the next 
instruction is stalled for one slot. | 
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LDS.L Instruction (PR): Includes the following instruction type: 


e LDS.L @Rm+, PR 


- > oe 2 + > |] |] @: Slot 
InstructionA IF ID EX MA WB 
Next instruction 


Third instruction 





Figure 7.86 LDS.L Instruction (PR) Pipeline 


Operation: The pipeline has five stages: IF, ID, EX, MA, and WB (figure 7.86). It is the same as 
an ordinary load instruction. 
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STS.L Instruction (PR): Includes the following instruction type: 


¢ STS.LPR, @—Rn 


+> +> +> +> +> <> <> <> OH : Slo 


Instruction A IF ID EX MA 


Next instruction IF 
Third instruction IF ID EX 





Figure 7.87 STS.L Instruction (PR) Pipeline 


Operation: The pipeline has four stages: IF, ID, EX, and MA (figure 7.87). It is the same as an 
ordinary store instruction. | 
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Register — MAC Transfer Instructions: Include the following instruction types: 


¢ CLRMAC 
e LDS Rm, MACH 
e LDS Rm, MACL 


—> <<? > +] > > > |] @-:: Siot 
InstructionA IF ID EX MA 


Nextinstruction IF ID EX. --w-. 
Thirdinstruction JF ID EX. ew. 





Figure 7.88 Register - MAC Transfer Instruction Pipeline 


Operation: The pipeline has four stages: IF, ID, EX, and MA (figure 7.88). The MA is a stage for 
accessing the multiplier. The MA contends with the IF. This makes it the same as ordinary store 
instructions. Since the multiplier contends with the MA, see the section for the SOP instruction, © 
multiply instruction, and double precision multiply instruction. 
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Memory — MAC Transfer Instructions: Include the following instruction types: 


* LDSL @Rm+,MACH 
¢ LDS.L @Rm+, MACL 


> > > + > YD |} @ :: Slot 
InstructionA IF ID EX MA 


Next instruction 
Third instruction 





Figure 7.89 Memory — MAC Transfer Instruction Pipeline 


Operation: The pipeline has four stages: IF, ID, EX, and MA (figure 7.89). The MA contends 
with the IF. The MA is a stage for memory access and multiplier access. This makes it the same as 
ordinary load instructions. Since the multiplier contends with the MA, see the section for the SOP 
instruction, multiply instruction, and double precision multiply instruction. 
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MAC -— Register Transfer Instructions: Include the following instruction types: 


e STS MACH, Rn 
e STS MACL, Rn 


> +> +> + |] > + > @ : Slot 
InstructionA IF ID EX MA WB 
Next instruction 


Third instruction 





Figure 7.90 MAC — Register Transfer Instruction Pipeline 


Operation: The pipeline has five stages: IF, ID, EX, MA, and WB (figure 7.90). The MA is a 
stage for accessing the multiplier. The MA contends with the IF. This makes it the same as 
ordinary load instructions. Since the multiplier contends with the MA, see the section for the SOP 
instruction, multiply instruction, and double precision multiply instruction. 
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MAC —> Memory Transfer Instructions: Include the following instruction types: — 


e STS.L MACH, @-Rn 
e STS.L MACL, @-Rn 


Pr DO DY OD OD @&: Slo 


Instruction AIF ID EX MA 


Next instruction 
Third instruction 





Figure 7.91 MAC > Memory Transfer Instruction Pipeline 


Operation: The pipeline has four stages: IF, ID, EX, and MA (figure 7.91). The MA is a stage for 
accessing the memory and the multiplier. The MA contends with IF. This makes it the same as 
ordinary store instructions. Since the multiplier contends with the MA, see the section for the SOP 
instruction, multiply instruction, and double precision multiply instruction. 
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RTE Instruction: Includes the following instruction type: 


e RTE 


> > + or oY YD HD DH @&: Slt 
RTE IF ID EX MA MA 
Delay slot — 


Branch destination 





Figure 7.92 RTE Instruction Pipeline 


The pipeline has five stages: IF, ID, EX, MA, and MA (figure 7.92). The MAs contend with the 
IF. The RTE is a delayed branch instruction. The ID of the delay slot instruction is stalled for 3 


slots. The IF of the branch destination instruction starts from the slot following the MA of the 
RTE. | 
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TRAP Instruction: Includes the following instruction type: 


e TRAPA #imm 


<> <> <> <> <> «o> «> o> o> o>? D> @ -@ : Slot 
TRAPA IF ID EX EX MA MA MA EX EX 
Next instruction 


Third instruction IF 
Branch destination IF ID EX 
IF ID EX 





Figure 7.93 TRAP Instruction Pipeline 


The pipeline has nine stages: IF, ID, EX, EX, MA, MA, MA, EX, and EX (figure 7.93). The MAs 
contend with the IF. The TRAP is not a delayed branch instruction. The two instructions after the 
TRAP instruction are fetched, but they are discarded without being executed. The IF of the branch 
destination instruction starts from the slot of the EX in the ninth stage of the TRAP instruction. 


240 


SLEEP Instruction: Includes the following instruction type: 


e SLEEP 


> or oD OO OD @: Slot 
SLEEP _IF_ ID EX 


Next instruction IF 





Figure 7.94 SLEEP Instruction Pipeline 


Operation: The pipeline has three stages: IF, ID and EX (figure 7.94). It is issued until the IF of 
the next instruction. After the SLEEP instruction is executed, the CPU enters sleep mode or 
standby mode. 
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7.7.7 Exception Processing 


Interrupt Exception Processing: Includes the following instruction type: 


e Interrupt exception processing 


Next instruction 


| Branch destination 





Figure 7.95 Interrupt Exception Processing Pipeline 


Operation: The interrupt is received during the ID stage of the instruction and everything after the 
ID stage is replaced by the interrupt exception processing sequence. The pipeline has ten stages: 
IF, ID, EX, EX, MA, MA, EX, MA, EX, and EX (figure 7.95). Interrupt exception processing is 
not a delayed branch. In interrupt exception processing, an overrun fetch (IF) occurs. In branch 
destination instructions, the IF starts from the slot that has the final EX in the interrupt exception 
processing. 


Interrupt sources are external interrupt request pins such as NMI, user breaks, and on-chip 
peripheral module interrupts. 


242 


Address Error Exception Processing: Includes the following instruction type: 


e Address error exception processing 


Next instruction 
Branch destination 





Figure 7.96 Address Error Exception Processing Pipeline 


Operation: The address error is received during the ID stage of the instruction and everything 
after the ID stage is replaced by the address error exception processing sequence. The pipeline has 
ten stages: IF, ID, EX, EX, MA, MA, EX, MA, EX, and EX (figure 7.96). Address error exception 
processing is not a delayed branch. In address error exception processing, an overrun fetch (IF) 
occurs. In branch destination instructions, the IF starts from the slot that has the final EX in the 
address error exception processing. 


Address errors are caused by instruction fetches and by data reads or writes. For details of the 
error cause, refer to the appropriate hardware manual. 
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Illegal Instruction Exception Processing: Includes the following instruction type: 


e Tllegal instruction exception processing 


> dvo vo OO YOO HD @D: Slot 


Next instruction 
(Third instruction 
Branch destination ID EX 





Figure 7.97 Illegal Instruction Exception Processing Pipeline 


Operation: The illegal instruction is received during the ID stage of the instruction and everything 
after the ID stage is replaced by the illegal instruction exception processing sequence. The pipeline 
has nine stages: IF, ID, EX, EX, MA, MA, MA, EX, and EX (figure 7.97). Illegal instruction 
exception processing is not a delayed branch. In illegal instruction exception processing, an 
overrun fetch (IF) occurs. Whether there is an IF only in the next instruction or in the one after that 
as well depends on the instruction that was to be executed. In branch destination instructions, the 
IF starts from the slot that has the final EX in the illegal instruction exception processing. 


Illegal instruction exception processing is caused by ordinary illegal instructions and by illegal slot 
instructions. When undefined code placed somewhere other than the slot directly after the delayed 
branch instruction (called the delay slot) is decoded, ordinary illegal instruction exception 
processing occurs. When undefined code placed in the delay slot is decoded or when an instruction 
placed in the delay slot to rewrite the program counter is decoded, an illegal slot instruction 
exception handling occurs. 
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Appendix A_ Instruction Code 
See “6. Instruction Descriptions” for details. 


A.1 Instruction Set by Addressing Mode 


Table A.1 lists instruction codes and execution states by addressing modes. 
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Table A.1 


Instruction Set by Addressing Mode 
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Types 
Addressing Mode Category | Sample Instruction SH-2 SH-1 
No operand — NOP 8 8 
Direct register addressing Destination operand only -MOVT Rn 18 17 
Source and destination ADD Rm, Rn 34 31 
operand 
Load and store with control LDC Rm, SR 12 12 
register or system register sts MACH, Rn 
Indirect register Source operand only JMP @Rm 2 2 
addressing Destination operand only TAS.B @Rn 1 1 
Data transfer with direct MOV.L Rm, @Rn 6 6 
register addressing 
Post increment indirect Multiply/accumulate operation MAC.W @Rm+, @Rn+ 2 1 
register addressing 
Data transfer from direct MOV.L @Rm+,Rn 3 3 
register addressing 
Load to control register or LDC.L @Rm+,SR 6 6 
system register 
Pre decrement indirect Data transfer from direct MOV.L Rm, @-Rn 3 3 
register addressing register addressing 
Store from control register or STC.L SR, ¢-Rn 6 6 
system register | 
Indirect register addressing Data transfer with direct MOV.L Rm, @(disp, Rn) 6 6 
with displacement register addressing 
Indirect indexed register Data transfer with direct MOV.L Rm, @(RO,Rn) 6 6 
addressing register addressing 
Indirect GBR addressing Data transfer with direct MOV.L  R,@(disp,GBR) 6 6 
with displacement register addressing 
Indirect indexed GBR Immediate data transfer AND .B #imm, @(RO,GBR) 4 4 
addressing 
PC relative addressing with Data transfer to directregister MOV.L @(disp,PC),Rn 3 3 
~ displacement addressing | 
PC relative addressing with —_ Branch instruction BRAF Rm 2 0 
Rm 
PC relative addressing Branch instruction BRA label 
Immediate addressing Arithmetic logical operations ADD #imm, Rn 7 
with direct register addressing 
Specify exception processing TRAPA #imm 1 1 
vector 
Total: 142 133 





A.1.1 No Operand 
Table A.2) No Operand 


Instruction Code Operation State TBit 
CLRT 0000000000001000 0>T 1 0 
CLRMAC 0000000000101000 0 — MACH, MACL 1 — 
DIVOU 0000000000011001 0 > M/Q/T 1 0 
NOP 0000000000001001 No operation 1 — 
RTE 0000000000101011 Delayed branch, Stack area 4 LSB 
— PC/SR 
RTS ~ 0000000000001011 Delayed branch,PR->PC 2 — 
SETT 0000000000011000 13 T 1 1 
SLEEP 0000000000011011 Sleep 3 — 
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A.1.2 Direct Register Addressing 


Table A.3 Destination Operand Only 
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Instruction Code Operation State T Bit | 
CMP/PL Rn 0100nnnn00010101 Rn>0,1—>T 1 Comparison result 
CMP/PZ Rn 0100nnnn00010001 Rn2=0,1—-T 1 Comparison result 
DT Rn* 0100nnnn00010000 +=Rn-1-—Rn 1 Comparison result 

When Rnis0O,1—>T, © 
when Rn is nonzero, 
0-T 
MOVT Rn 0000nnnn00101001 7T-—-Rn 1 _— 
ROTL Rn 0100nnnn00000100 T «Rn — MSB 1 MSB 
ROTR Rn 0100nnnn00000101 LSB > Rn > T 1 LSB 
ROTCL Rn 0100nnnn00100100  T«RneT 1 MSB 
ROTCR Rn 0100nnnn00100101 T—4Rn->T 1 LSB 
SHAL Rn 0100nnnn00100000 Te¢RneO 1 MSB 
SHAR Rn 0100nnnn00100001 + MSB—Rn—>T 1 LSB 
SHLL Rn 0100nnnn00000000 T¢Rn¢e-O 1 MSB 
SHLR Rn 0100nnnn00000001 O-Rn->T 1 LSB 
SHLL2 Rn 0100nnnn00001000 Rn<<2-— Rn 1 — 
SHLR2 Rn 0100nnnn00001001 $Rn>>2—- Rn 1 — 
SHLL8 Rn 0100nnnn00011000 #Rn<<8 — Rn 1 — 
SHLR8 Rn 0100nnnn00011001 #Rn>>8— Rn 1 — 
SHLL16 ~—— Rn 0100nnnn00101000 MRn<<16 — Rn 1 — 
SHLR16 Rn 0100nnnn00101001 #Rn>>16 > Rn 1 — 
Note: SH-2 CPU instruction 
- Table A.4 Source and Destination Operand 
Instruction Code Operation State TBit 
ADD Rm,Rn 0011nnnnmmmm1100 Rn +Rm-— Rn 1 — 
| ADDC Rm,Rn 0011nnnnmmmm1110 Rn +Rm+T— Rn, 1 Carry 
carry > T 
ADDV Rm,Rn 0011nnnnmmmm1111 Rn + Rm -—- Rn, 1 Overflow 
| overflow > T 
AND Rm,Rn 0010nnnnmmmm1001 Rn & Rm — Rn 1 — 


Table A.4 Source and Destination Operand (cont) 


Instruction 
CMP/EQ kRm,Rn 
CMP/HS Rm,Rn 
CMP/GE Rm,Rn 
CMP/HI Rm,Rn 
CMP/GT  Rm,Rn 
CMP/STR Rm,Rn 
DIV1 Rm, Rn 
DIVOS Rm, Rn 
DMULS.L Rm, Rn*? 
DMULU.L Rm, Rn*? 
EXTS.B  Rm,Rn 
EXTS.W  Rm,Rn 
EXTU.B  Rm,Rn 
EXTU.W  Rm,Rn 
MOV Rm, Rn 
MUL. L Rm, Rn*? 
MULS.W  Rm,Rn 
MULU.W  Rm,Rn 
NEG Rm, Rn 
NEGC Rm, Rn 
Notes: 


Code 
0011nnnnmmmm0000 


0011 nnnnmmmm0010 


0011nnnnmmmm0011 


0011nnnnmmmm0110 


0011nnnnmmmm0111 


0010nnnnmmmm1100 


0011nnnnmmmm0100 


0010nnnnmmmm0111 


0011innnnmmmm1101 
0011nnnnmmmm0101 
0110nnnnmmmm1110 
0110nnnnmmmm1111 
0110nnnnmmmm1100 
0110nnnnmmmm1101 


0110nnnnmmmm0011 
0000nnnnmmmm0111 
0010nnnnmmmm1111 


0010nnnnmmmm1110 


0110nnnnmmmm1011 
0110nnnnmmmm1010 


Operation 
When Rn = Rm, 1 > T 


When unsigned and Rn 
>Rm, 1 > T 


When signed and Rn = 
Rm, 1—-T 


When unsigned and Rn 
>Rm,1>T 


When signed and Rn > 
Rm, 13> T 


When a byte in Rn 
equals bytes in Rm, 1 
—>T 


1-step division (Rn + 
Rm) 


MSB of Rn > Q, MSB 
of Rm 4>M,M4Q>5T 


Signed, Rn x Rm > 
MACH, MACL 


Unsigned, Rn x Rm > 
MACH, MACL 


Sign — extends Rm 
from byte — Rn 


Sign — extends Rm 
from word — Rn 


Zero — extends Rm 
from byte — Rn 


Zero — extends Rm 
from word > Rn 


Rm — Rn 
Rn x Rm + MACL 


Signed, Rn x Rm 
MAC 


Unsigned, Rn x Rm > 
MAC 


0—-Rm— Rn 


0-Rm-T-— Rh, 
Borrow > T 


1. The normal minimum number of execution states 
2. SH-2 CPU instruction 
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State 
1 


2 to 4*! 


2 to 4*1 


1 
2 to 4*1 
1 to 3*1 


1 to 3*! 


T Bit 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Calculation 
result 


Calculation 
result 


Borrow 


Table A.4 Source and Destination Operand (cont) 








Instruction _ ‘Code Operation State TBit 
NOT Rm,Rn 0110nnnnmmmm0111 # ~Rm-—>Rn 1 = 
OR Rm,Rn — 0010nnnnmmmm1011 Rn | Rm — Rn 1 — 
SUB Rm,Rn 0011innnnmmmmi000 = Rn—Rm—Rn 1 — 
SUBC Rm,Rn 0011nnnnmmmm1010 Rn-—Rm-T-—->Rpo, | 1 Borrow 
7 Borrow > T 
SUBV Rm, Rn —‘0011nnnnmmmm1011 Rn—Rm-— Rn, 1 Underflow 
Underflow > T | 
SWAP.B  Rm,Rn 0110nnnnmmmm1000 -Rm—Swapupperand 1 _ 
lower halves of lower 2 
bytes — Rn 
SWAP.W  Rm,Rn 0110nnnnmmmm1001 Rm-— Swap upperand 1 — 
lower word — Rn 
TST Rm,Rn  0010nnnnmmmm1000 Rn & Rm, when resultis 1 Test results 
| 0,1 >T 
XOR Rm,Rn 0010nnnnmmmm1010 Rn “Rm — Rn 1 — 
XTRCT Rm,Rn 0010nnnnmmmm1101 Center 32 bits of Rmand 1 — 
-Rn—- Rn 
Table A.5 Load and Store with Control Register or System Register 
Instruction Code Operation State T Bit 
LDC Rm, SR 0100mmmm00001110 Rm —- SR 1 LSB 
LDC Rm, GBR 0100mmmm00011110 Rm — GBR 1 — 
LDC Rm, VBR 0100mmmm00101110 Rm — VBR 1 — 
LDS Rm, MACH 0100mmmm00001010 Rm — MACH 1 ~~ 
LDS Rm ,MACL 0100mmmm00011010. Rm — MACL 1 — 
LDS Rm, PR 0100mmmm00101010 Rm — PR 1 — 
STC SR,Rn 0000nnnn00000010 SR — Rn 1 — 
STC GBR, Rn 0000nnnn00010010 GBR > Rn 1 ~— 
STC VBR, Rn 0000nnnn00100010 VBR > Rn 1 — 
STS MACH , Rn 0000nnnn00001010 MACH — Rn 1 — 
STS MACL, Rn 0000nnnn00011010 MACL — Rn 1 — 
STS PR, Rn 0000nnnn00101010 PR -— Rn 1 — 
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A.1.3 


Table A.6 Destination Operand Only 


Indirect Register Addressing 


Instruction Code Operation State T Bit 

JMP @Rm 0100mmmm00101011 Delayed branch, Rm — PC 2 — 

JSR @Rm 0100mmmm00001011 Delayed branch, PC > PR, 2 — 
Rm — PC 

TAS.B @Rn 0100nnnn00011011 When (Rn) is 0, 1 > T, 1 > 4 Test results 
MSB of (Rn) 


Table A.7_ Data Transfer with Direct Register Addressing 


Instruction 
MOV.B_ Rm, @Rn 
MOV.W_ Rm, @Rn 
MOV.L Rm, @Rn 
-MOV.B @Rm,Rn 
MOV.W  @Rm,Rn 
MOV.L @Rm,Rn 


Code —_ 

0010nnnnmmmm0000 
0010nnnnmmmm0001 
0010nnnnmmmm0010 
011 0nnnnmmmm0000 
0110nnnnmmmm0001 
0110nnnnmmmm0010 


Operation State T Bit 
Rm —> (Rn) 1 — 
Rm — (Rn) 1 — 
Rm — (Rn) 1 — 
(Rm) — sign extension > Rn 1 — 
(Rm) — sign extension — Rn 1 — 
(Rm) — Rn 1 — 


A.1.4 Post Increment Indirect Register Addressing 


Table A.8 Miultiply/Accumulate Operation | 


Instruction Code 

MAC.L @Rm+,@Rn+*2 QOO0Onnnnmmmm1111 
MAC.W @Rmt+, @Rn+ 0100nnnnmmmm1111 
Notes: 


Operation State T Bit 
Signed, (Rn) x(Rm)+MAC %eto4*! — 
— MAC 

Signed, (Rn) x (Rm) + MAC — 3/(2)*1 — 

— MAC 


1. The normal minimum number of execution states (The number in parentheses is the 
number of states when there is contention with preceding/following instructions). 


2. SH-2 CPU instruction 
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Table A.9 Data Transfer from Direct Register Addressing 


Instruction Code Operation _—_T . $tate TBit 
MOV.B @Rm+,Rn 0110nnnnmmmm0100 (Rm) — sign extension > 1 — 
oe 7 Rn, Rm +1—> Rm : oe 
MOV.W  @Rm+,Rn 0110nnnnmmmm0101 (Rm) > sign extension > 1 — 
~ Rn, Rm+2—> Rm 
MOV.L @Rm+,Rn 0110nnnnmmmm0110  (Rm)—>Rn,Rm+4—> Rm 1 — 


Table A.10 Load to Control Register or System Register _ 


Instruction : Code Operation. 3 State TBit 
LDC.L @Rm+,SR 0100mmmm00000111 (Rm)—SR,Rm+4—>Rm 3 LSB 
LDC.L @Rm+,GBR  0100mmmm00010111. + +=(Rm)—+GBR,Rm+4>Rm_~ 3 — 
LDC.L @Rm+,VBR  0100mmmm00100111 +=(Rm)>VBR,Rm+45Rm 3 - 
LDS.L @Rm+,MACH  0100mmmm00000110 (Rm) > MACH, Rm+4—Rm 1 — 
LDS.L @Rm+,MACL 0100mmmm00010110 (Rm) — MACL, Rm + 4 a Rm 1 — 
LDS.L @Rm+,PR  0100mmmm00100110 = (Rm) > PR, Rm+4— Rm 1 = 





A.1.5 Pre Decrement Indirect Register Addressing 
Table A.11 Data Transfer from Direct Register Addressing 


Instruction Code Operation State T Bit 


MOV.B- Rm, @—Rn 0010nnnnmmmm0100 Rn-1-— Rn, Rm > (Rn) 1 ae 
MOV.W Rm, @—Rn ~0010nnnnmmmm0101 Rn-2-> Rn, Rm (Rn) 1 — 
MOV.L Rm, @—Rn 0010nnnnmmmm0110 - Rn—4-—> Rn, Rm —- (Rn) 1 — 
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Table A.12 Store from Control Register or System Register 


Instruction 

STC.L SR, @-Rn 
STC.L GBR, @-Rn 
STC.L VBR, @-Rn 
STS.L MACH, @—Rn 
STS.L MACL, @—Rn 
STS.L PR, @-Rn 
A.1.6 


Code 

0100nnnn00000011 
0100nnnn00010011 
0100nnnn00100011 
0100nnnn00000010 
0100nnnn00010010 
0100nnnn00100010 


Operation 

Rn - 4 > Rn, SR — (Rn) 

Rn — 4 > Rn, GBR —> (Rn) 
Rn - 4 > Rn, VBR => (Rn) 
Rn — 4 > Rn, MACH -> (Rn) 
Rn — 4 — Rn, MACL -> (Rn) 
Rn — 4 > Rn, PR — (Rn) 


Indirect Register Addressing with Displacement 


Table A.13 Indirect Register Addressing with Displacement 


Instruction 


MOV .B 
MOV .W 
MOV .L 
MOV .B 


MOV .W 


MOV.L 


A.1.7 


RO, @(disp, Rn) 
RO, @(disp, Rn) 
Rm, @(disp, Rn) 
@( disp, Rm), RO 


@(disp, Rm) ,R0O 


@(disp, Rm) ,Rn 


Indirect Indexed Register Addressing 


Code 


Operation 


10000000nnnndddd RO — (disp + Rn) 
10000001nnnndddd RO — (disp x 2 + Rn) 
0001nnnnmmmmdddd Rm — (disp x 4 + Rn) 
10000100mmmmdddd (disp + Rm) —> sign 


extension > RO 


10000101mmmmddadd (disp x2 + Rm) > sign 


extension — RO 


0101nnnnmmmmdddd (disp x4+Rm)—> Rn 


Table A.14 Indirect Indexed Register Addressing 


Instruction 

MOV.B- Rm, @(RO,Rn) 
MOV.W- Rm, @(RO,Rn) 
MOV.L Rm, @(RO, Rn) 
MOV.B @(RO,Rm),Rn 
MOV.W @(RO,Rm),Rn 
MOV. L 


@(RO,Rm),Rn 


Code 

0000nnnnmmmm0100 
0000nnnnmmmm0101 
0000nnnnmmmm0110 
0000nnnnmmmm1100 


0000nnnnmmmm1101 


0000nnnnmmmm1110 


Operation 

Rm — (RO + Rn) 
Rm — (RO + Rn) 
Rm —> (RO + Rn) 


(RO + Rm) > sign 
extension > Rn 


(RO + Rm) > sign 
extension > Rn 


(RO + Rm) > Rn 
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State 


2 
2 
2 
1 
1 
1 


State 


ee 


T Bit 


T Bit 


T Bit 


A.1.8 


Indirect GBR Addressing with Displacement 


Table A.15 Indirect GBR Addressing with Displacement 
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TBit 





Instruction Code Operation State 

MOV.B- RO, @(disp,GBR) 11000000dddddddd RO-(disp+GBR) 1 — 

MOV.W_ RO, @(disp, GBR) 11000001dddddddd RO — (disp x 2 + 1 — 

. _ GBR) 
MOV.L RO, @(disp,GBR) 11000010dddddddd RO — (disp x 4+ 1 — 
| , GBR) 

MOV.B @(disp,GBR),RO 11000100ddddddda_ (disp + GBR) —> sign 1 — 
extension — RO 

MOV.W @(disp,GBR),RO . 11000101dddddddd (disp x2+GBR)—> 1 — 
sign extension — RO | 

MOV.L @(disp,GBR),RO 11000110dddddddd_ (disp x4+GBR)—> 1 —_ 
RO 

A.1.9 Indirect Indexed GBR Addressing 

Table A.16 Indirect Indexed GBR Addressing 

Instruction Code — Operation State TBit 

AND.B #imm,@(RO,GBR) 11001101iiiiiiii (RO + GBR) & imm > 3 — 

; | (RO + GBR) 
OR.B  #imm, @(RO,GBR) 110011114iiii1ii (RO + GBR) |limm— (RO 3 — 
| + GBR) 
TST.B #imm, @(RO,GBR) 11001100iiiiiii11 (RO + GBR) &imm, when 3 Test 
result is 0, 1 — T | results 
XOR.B #imm,@(RO,GBR)  11001110iiiiiiii1 (RO+GBR)“4imm-— (RO 3 = 
7 + GBR) | 

A.1.10 PC Relative Addressing with Displacement 

Table A.17 PC Relative Addressing with Displacement 

Instruction Code Operation State TBit 

MOV.W @(disp,PC),Rn  1001nnnndddddddd (disp x2+PC)— sign 1 — 

extension — Rn 
MOV.L @(disp,PC),Rn  1101nnnndddddddd (disp x 4+PC) — Rn 1 — 
MOVA @(disp,PC),RO 11000111ddddddda disp x4+PC— RO 1 — 





A.1.11_ PC Relative Addressing with Rm 
Table A.18 PC Relative Addressing with Rm 


Instruction Code Operation State TBit 

BRAF Rm*? 0000mmmm00100011 Delayed branch, Rm + PC > PC 2 — 

BSRF  Rm*? 0000mmmm00000011 Delayed branch, PC + PR,Rm+PC 2 — 
— PC 


Notes: 2. SH-2 CPU instruction 


A.1.12 PC Relative Addressing 
Table A.19 PC Relative Addressing 


Instruction Code Operation | State  T Bit 

BF — label 10001011dddddddd WhenT=0,dispx2+PC—>PC; 3/1*% — 
When T = 1, nop 7 

BF/S label*? 10001111dddddddd When T =0, disp x2+PC—+PC; 2/1*9 — 
When T = 1, nop | 

BT label 10001001dddddddd WhenT=1,dispx2+PC—+PC; 3/1 — 
When T = 0, nop 

BT/S label*? 10001101dddddddd When T = 1, disp x2+PC—4PC; 2/1*° — 
When T = 0, nop 

BRA label 1010dddddddddddd Delayed branch, dispx2+PC—> 2 — 
PC 

BSR label 1011dddddddddddd Delayed branch, PC > PR, dispx 2 — 
2+PC—+PC 


Notes: 2. SH-2 CPU instruction 
3. One state when it does not branch 
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A.1.13 Immediate 


Table A.20 Arithmetic Logical Operation with Direct Register Addressing 





Instruction Code Operation State T Bit 

ADD #imm,Rn 0O11innnniiiiiiii Rn + imm > Rn 1 — 

AND #imm,RO 11001001iiiiiiii RO & imm — RO 1 — 

CMP/EQ #imm,RO 10001000iiiiiiii | WhenRO=imm,1—3T 1 Comparison 

result 

MOV #imm,Rn 1110nnnniiiiiiii imm — sign extension — Rn 1 — 

OR #imm,RO 11001011iiiiiiii RO | imm — RO 1 — 

TST #imm,RO 11001000iiiiiiii RO &imm, when resultisO, 1 Test results 
1T 

XOR #imm,RO 11001010iiiiiiii RO “imm — RO 1 — 

Table A.21 Specify Exception Processing Vector 

Instruction Code Operation State T Bit 

TRAPA #imm 11000011iiiiiiii PC/SR — Stack area, (mmx4+ 8 — 

VBR) —>.PC 
A.2 Instruction Sets by Instruction Format 


Tables A.22 to A.48 list instruction codes and execution states by instruction formats. 
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Table A.22 Instruction Sets by Format 


Format Category 


0 
n 


nm 


md 


nd4 


nmd 


d12 
nd8& 


ni 


Direct register addressing 


Direct register addressing (store with control 
or system registers) 


Indirect register addressing 
Pre decrement indirect register addressing 


Direct register addressing (load with control 
or system registers) 


PC relative addressing with Rn 

Direct register addressing 

Post increment indirect register addressing 
Direct register addressing 

Indirect register addressing 


Post increment indirect register addressing 
(multiply/accumulate operation) 


Post increment indirect register addressing 
Pre decrement indirect register addressing 
Indirect indexed register addressing 


Indirect register addressing with 
displacement 


Indirect register addressing with 
displacement 


Indirect register addressing with 
displacement 


Indirect GBR addressing with displacement 
Indirect PC addressing with displacement 
PC relative addressing 

PC relative addressing 

PC relative addressing with displacement 
Indirect indexed GBR addressing 


Immediate addressing (arithmetic and logical 


operations with direct register) 


Immediate addressing (specify exception 
processing vector) 


Immediate addressing (direct register 
arithmetic operations and data transfers ) 
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Sample Instruction 


NOP 
MOVT 
STS 


TAS .B 
STC.L 
LDC 


; 
5 


MOV. 
MOV. 
MOV. 
MOV. 


Dirirye 


MOV .B 


MOV .L 


MOV. L 
MOVA 
BF 
BRA 
MOV.L 
AND .B 
AND 


TRAPA 


ADD 


Rn 
MACH , Rn 


@Rn 
SR, @—Rn 
Rm, SR 


Rm 

@Rm 
@Rm+,SR 
Rm, Rn 

Rm, @Rn 
@Rm+, @Rn+ 


@Rm+, Rn 

Rm, @—Rn 

Rm, @(RO, Rn) 
@(disp,Rm),RO 


RO, @(disp, Rn) 
Rm, @(disp, Rn) 


RO, @(disp, GBR) 
@(disp,PC),RO 
label 

label 
@(disp,PC),Rn 
#imm, @(RO,GBR) 
#imm, RO 


#imm 


#imm, Rn 


Total: 


Types 
SH-2 SH-1 
8 8 
18 17 
6 6 
1 1 
6 6 
6 6 
2 0 
2 2 
6 6 
34 31 
6 6 
2 1 
3 3 
3 3 
6 6 
2 2 
2 2 
2 2 
6 6 
1 1 
4 2 
2 2 
2 2 
4 4 
5 5 
| | 
2 2 
142 133 


A.2.1 0 Format 
Table A.23 0 Format 





Instruction = Code | Operation State T Bit 
CLRT 0000000000001000 0O->T 1 0 
CLRMAC 0000000000101000 0 — MACH, MACL 1 "Sape 
DIVOU 0000000000011001 0 > M/Q/T 1 0 
NOP 0000000000001001 No operation 1 — 
RTE | 0000000000101011 Delayed branching, stack 4 LSB 
7 area > PC/SR — 
RTS | 0000000000001011 Delayed branching, PR> 2 a 
PC | 

SETT 0000000000011000 1>T | 1. 1 
SLEEP _ 0000000000011011 Sleep of 


Notes: 4. This is the number of states until a transition is made to the Sleep state. 
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A.2.2 


n Format 


Table A.24 Direct Register Addressing 


Instruction 


CMP/PL Rn 


CMP /PZ 
DT 


MOVT 
ROTL 
ROTR 
ROTCL 
ROTCR 
SHAL 
SHAR 
SHLL 
SHLR 
SHLL2 
SHLR2 
SHLL8 
SHLR8 
SHLL16 
SHLR16 
Notes: 


Rn 


Rn*2 


S\8i8 isis /s/s|sF\|F1F/F1/81F\18 18 


Code 

0100nnnn00010101 
0100nnnn00010001 
0100nnnn00010000 


0000nnnn00101001 
0100nnnn00000100 
0100nnnn00000101 
0100nnnn00100100 
0100nnnn00100101 
0100nnnn00100000 
0100nnnn00100001 
0100nnnn00000000 
0100nnnn00000001 
0100nnnn00001000 
0100nnnn00001001 
0100nnnn00011000 
0100nnnn00011001 
0100nnnn00101000 
0100nnnn00101001 


2. SH-2 CPU instruction. 


Operation 
Rn>0,1—5T 
Rn2=0,1—-> T 


Rn-1-— Rn; 
If Rn is 0, 1 > T, if Rn 
is nonzero, 0 > T 


T— Rn 

T<— Rn — MSB 
LSB — Rn > T 
TeRnewt 
T—>Rn>T 
TeRn<O 
MSB —> Rn > T 
TeRn<O0 
0O—>Rn->T 
Rn<<2 — Rn 
Rn>>2 — Rn 
Rn<<8 — Rn 
Rn>>8 — Rn 
Rn<<16 — Rn 


Rn>>16 — Rn 


State 


ee ee ee ee ee ee ee 


T Bit 


Comparison result 
Comparison result 


Comparison result 


MSB 
LSB 
MSB 
LSB 
MSB 
LSB 
MSB 
LSB 


Table A.25 Direct Register Addressing (Store with Control and System Registers) 





Instruction Code Operation State T Bit 
STC SR,Rn 0000nnnn00000010 SR —> Rn 1 — 
STC GBR, Rn 0000nnnn00010010 GBR —> Rn 1 — 
STC VBR, Rn 0000nnnn00100010 VBR — Rn 1 — 
STS MACH, Rn 0000nnnn00001010 MACH — Rn { — 
STS MACL, Rn 0000nnnn00011010 MACL — Rn 1 — 
STS PR, Rn 0000nnnn00101010 PR — Rn 1 — 
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Table A.26 Indirect Register Addressing 


Instruction Code 


TAS.B @Rn 


0100nnnn00011011 


Operation | _ State 


When (Rn) is 0, 1 > T, 1 > 4 


MSB of (Rn) — 


Table A.27 Pre Decrement Indirect Register 


T Bit 


Test results 





Instruction Code Operation State T Bit 
STC.L SR,@-Rn 0100nnnn00000011 Rn — 4 > Rn, SR = (Rn) 2 — 
STC.L GBR, @-Rn 0100nnnn00010011 Rn-4-— Rn, GBR—- (Rn) 2 — 
STC.L VBR, @-Rn 0100nnnn00100011 Rn-—-4-— Rn, VBR-(Rn) 2 — 
STS.L MACH, @-Rn 0100nnnn00000010 Rn —4-— Rn, MACH - (Rn) 1 — 
STS.L MACL, @-Rn 0100nnnn00010010 Rn—4- Rn, MACL > (Rn) 1 — 
STS.L PR, @—-Rn 0100nnnn00100010 Rn — 4 — Rn, PR —> (Rn) 1 = 
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A.2.3. m Format 
Table A.28 Direct Register Addressing (Load with Control and System Registers) 


Instruction Code | Operation State T Bit 
LDC Rm, SR 0100mmmm00001110 Rm —> SR 1 LSB 
LDC Rm, GBR 0100mmmm00011110 Rm — GBR 1 — 
LDC Rm, VBR 0100mmmm00101110 Rm — VBR 1 — 
LDS Rm, MACH 0100mmmm00001010 Rm — MACH 1 — 
LDS Rm,MACL 0100mmmm00011010 Rm — MACL 1 —_ 
LDS Rm, PR 0100mmmm00101010 Rm — PR 1 — 


Table A.29 Indirect Register 


Instruction Code Operation State T Bit 

JMP @Rm  0100mmmm00101011 Delayed branch, Rm — PC 2 — 

JSR = @Rm 0100mmmm00001011 ##Delayed branch, PC — PR, 2 — 
Rm — PC 


Table A.30 Post Increment Indirect Register 


Instruction Code Operation State T Bit 
LDC.L @Rm+,SR 0100mmmm00000111 (Rm) > SR, Rm + 4 —- Rm 3 LSB 
LDC.L @Rm+,GBR  0100mmmm00010111 (Rm) > GBR,Rm+4—>5Rm_— 3 — 
LDC.L @Rm+,VBR 0100mmmm00100111 (Rm) > VBR, Rm + 4 > Rm 3 — 
LDS.L @Rm+,MACH 0100mmmm00000110 (Rm) - MACH, Rm+4— Rm 1 — 
LDS.L @Rm+,MACL 0100mmmm00010110 (Rm) — MACL, Rm+4-—Rm = 1 — 
LDS.L @Rmt+,PR 0100mmmm00100110 (Rm) — PR, Rm + 4 —- Rm 1 — 


Table A.31 PC Relative Addressing with Rm 


Instruction Code Operation State T Bit 
BRAF  Rm*?2 0000mmmm00100011 Delayed branch, Rm + PC -— PC 2 — 


BSRF  Rm*? 0000mmmm00000011 Delayed branch, PC > PR,Rm+PC 2 — 
— PC : 


Notes: 2. SH-2 CPU instruction 
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A.2.4 


nm Format 


Table A.32 Direct Register Addressing 


Instruction 


ADD 
ADDC 


ADDV 


AND 
CMP/EQ 


CMP/HS 
CMP/GE 
CMP /HI 
CMP/GT 
CMP/STR 
DIV1 
DIVOS 
DMULS . L 
DMULU .L 
EXTS .B 
EXTS .W 
EXTU.B 
EXTU .W 


MOV 
Notes: 


1. 
2 


Rm, Rn 
Rm,Rn — 


Rm, Rn 


Rm, Rn 
Rm, Rn 


Rm, Rn 


Rm, Rn 


Rm, Rn 


Rm, Rn 
Rm, Rn 
Rm, Rn 
Rm, Rn 
Rm, Rn*? 
Rm, Rn*? 
Rm, Rn 
Rm, Rn 
Rm, Rn 
Rm, Rn 


Rm, Rn 


Code | 
0011nnnnmmmm1100 
0011nnnnmmmn1110 


0011nnnnmmmmi111 


0010nnnnmmmm1001 
0011nnnnmmmm0000 


0011 nnnnmmmm0010 


0011nnnnmmmm0011 


0011nnnnmmmm0110 | 


0011nnnnmmmm0111 


0010nnnnmmmm1100 


0011nnnnmmmm0100 


0010nnnnmmmm0111 


001innnnmmmm1101, 


0011nnnnmmmm0101 
0110nnnnmmmm1110 
0110nnnnmmmm1111 
0110nnnnmmmm1100 
0110nnnnmmmm1101 


0110nnnnmmmm0011 


Operation 
Rn +Rm-— Rn 


Rn + Rm+T — Rn, carry 
oT 


Rn + Rm —> Rn, overflow 
+T 


Rn & Rm — Rn 
When Rn = Rm, 1 > T 


When unsigned and Rn = 
Rm, 13 T 


When signed and Rn = 
Rm, 1 35 T 


When unsigned and Rn > 
Rm, 17> T 


When signed and Rn > 


Rm, 173 T 


When a byte in Rn equals 
a byte in Rm, 1 > T 


1-step division (Rn + Rm) 


-MSB of Rn > Q, MSB of 


Rm>M,M4Q>T> 


Signed, Rn x Rm > 
MACH, MACL 


Unsigned, Rn x Rm > 
MACH, MACL 


Sign-extends Rm from 
byte — Rn 


Sign-extends Rm from 
word — Rn 


Zero-extends Rm from 
byte — Rn 


Zero-extends Rm from : 
word —> Rn 


Rm —> Rn 


The normal minimum number of execution states 
SH-2 CPU instruction 
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State 
1 
1 


2 to 4*1 


2 to 4*1 


T Bit 


Carry 


Overflow 


~ Comparison 


result 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Comparison 
result 


~ Calculation 


result 


Calculation 
result 


Table A.32 Direct Register Addressing (cont) 


Instruction Code Operation State TBit 

MUL.L Rm,Rn*? 0000nnnnmmmm0111 Rn x Rm > MACL 2to4*t — 

MULS.W Rm,Rn 0010nnnnmmmm1111 Signed, RnxRm—MAC 1to3*! — 

MULU.W Rm,Rn 0010nnnnmmmm1110 Unsigned, Rn x Rm > 1to3*? _ 
MAC 

NEG Rm, Rn 0110nnnnmmmm1011 0—Rm —> Rn 1 — 

NEGC Rm, Rn 0110nnnnmmmm1010 0O—Rm—T-— Rn, borrow 1 Borrow 
+ T 

NOT Rm, Rn 0110nnnnmmmm0111 ~Rm > Rn 1 — 

OR Rm, Rn 0010nnnnmmmm1011 Rn | Rm —- Rn 1 — 

SUB Rm, Rn 0011nnnnmmmm1000 Rn —Rm > Rn | — 

SUBC Rm, Rn 0011nnnnmmmm1010 Rn —-Rm-—-T-—- Rh, 1 Borrow 
borrow > T 

SUBV Rm, Rn 0011nnnnmmmm1011 Rn —Rm — Rn, underflow 1 Underflow 
73 T 

SWAP.B Rm,Rn 0110nnnnmmmm1000 Rm — Swap upper and 1 — 
lower halves of lower 2 
bytes — Rn 

SWAP.W Rm,Rn 0110nnnnmmmm1001 Rm — Swap upper and 1 — 
lower word > Rn 

TST Rm, Rn 0010nnnnmmmm1000 Rn & Rm, when result is | Test results 
0,1 5T 

XOR Rm, Rn 0010nnnnmmmm1010 Rn4Rm—- Rn 1 — 

XTRCT  Rm,Rn 0010nnnnmmmm1101 Center 32 bits of Rm and = 1 — 
Rn — Rn 


Notes: 1. The normal minimum number of execution cycles. 
2. SH-2 CPU instructions 


Table A.33 Indirect Register Addressing 


Instruction Code Operation State T Bit 
MOV.B- Rm, @Rn 0010nnnnmmmm0000 Rm — (Rn) 1 — 
MOV.W Rm, @Rn 0010nnnnmmmm0001 Rm => (Rn) 1 — 
MOV.L Rm, @Rn 0010nnnnmmmm0010 Rm — (Rn) 1 = 
MOV.B @Rm,Rn 0110nnnnmmmm0000 (Rm) — sign extension => Rn 1 = 
MOV.W  @Rm,Rn 0110nnnnmmmm0001 (Rm) > sign extension > Rn 1 = 
MOV.L @Rm,Rn 0110nnnnmmmm0010 (Rm) > Rn | 1 —_ 
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Table A.34 Post Increment Indirect Register (Multiply/Accumulate Operation) 


Instruction Code Operation 
MAC.L @Rm+,@Rn+*?  0000nnnnmmmm1111 Signed, (Rn) x (Rm) + 
| MAC — MAC 
| MAC.W @Rmt+, @Rn+ 0100nnnnmmmm1111 Signed, (Rn) x (Rm) + 
MAC — MAC 


State 


3/(2 to 
4)*1 


3/(2)*! 


T Bit 


Notes: 1. The normal minimum number of execution cycles.(The number in parentheses in the 
number of cycles when there is contention with preceding/following instructions). 
2. SH-2 CPU instruction. 


Table A.35 Post Increment Indirect Register 


instruction 

MOV.B @Rm+,Rn 
MOV.W  @Rm+,Rn 
MOV.L @Rm+,Rn 


Code 
0110nnnnmmmm0100 


0110nnnnmmmm0101 


0110nnnnmmmm0110 


Operation 


(Rm) — sign extension > 
Rn, Rm + 1— Rm 


(Rm) — sign extension > 
Rn, Rm +2 — Rm 


(Rm) > Rn, Rm +4— Rm 


Table A.36 Pre Decrement Indirect Register 


Instruction 

MOV.B Rm, @—-Rn 
MOV.W Rm, @—-Rn 
MOV.L Rm, @—Rn 


Code 

0010nnnnmmmm0100 
0010nnnnmmmm0101 
0010nnnnmmmm0110 


Table A.37 Indirect Indexed Register 


Instruction 

MOV.B Rm, @(RO,Rn) 

MOV.W Rm, @(RO,Rn) 

MOV.L Rm, @(RO,Rn) 
_MOV.B @(RO,Rm),Rn 

MOV.W @(RO,Rm),Rn 

MOV.L @(RO,Rm),Rn 


Code 

0000nnnnmmmm0100 
0000nnnnmmmm0101 
0000nnnnmmmm0110 
0000nnnnmmmm1100 


0000nnnnmmmm1101 


0000nnnnmmmm1110 


Operation 

Rn -—1— Rn, Rm > (Rn) 
Rn — 2 — Rn, Rm — (Rn) 
Rn —4 — Rn, Rm — (Rn) 


Operation 

Rm — (RO + Rn) 
Rm — (RO + Rn) 
Rm — (RO + Rn) 


(RO + Rm) > sign 
extension — Rn 


(RO + Rm) - sign 
extension > Rn 


(RO + Rm) > Rn 
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State 


State 


Cycles 


a ee 


T Bit 


T Bit 


T Bit 


A.2.5 md Format 
Table A.38 md Format 


Instruction 
MOV.B @(disp,Rm),RO 


MOV.W @(disp,Rm),RO 


Code 
10000100mmmmdddd 


10000101mmmmdddd 


Operation 

(disp + Rm) — sign 
extension — RO 
(disp x 2 + Rm) > 
sign extension > 


State 


T Bit 


RO 


A.2.6 nd4 Format 
Table A.39 nd4 Format 


Instruction 
MOV .B RO, @(disp,Rn) 
MOV .W RO, @(disp,Rn) 


A.2.7. mnmd Format 
Table A.40 nmd Format 
Instruction 


MOV.L Rm, @(disp,Rn) 
MOV.L @(disp,Rm),Rn 


Code 
10000000nnnndddd 
10000001nnnndddd 


Code 
0001nnnnmmmmdddd 
0101nnnnmmmmdddd 
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Operation 
RO — (disp + Rn) 
RO — (disp x 2+ Rn) 


Operation 
Rm — (disp x 4 + Rn) 
(disp x 4+ Rm) —> Rn 


State 


State 


T Bit 


T Bit 


A.2.8 d Format 


Table A.41 Indirect GBR with Displacement 


Instruction 
MOV.B- RO, @(disp, GBR) 


MOV.W RO, @(disp, GBR) 


MOV.L RO, @(disp, GBR) 
MOV.B @(disp,GBR) , RO 
MOV.W @(disp,GBR),RO 


MOV.L @(disp,GBR),RO 


Code 


11000000dddddddd 


- 11000001dddddddd 


11000010dddddddd 


11000100dddddddd 


11000101dddddddd 


11000110dddddddd 


Table A.42 PC Relative with Displacement 


Instruction 


MOVA @(disp,PC),RO 


Code 
11000111dddddddd 


Table A.43 PC Relative Addressing 


3. One state when it does not branch 
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Operation 


~ RO — (disp + GBR) 


RO > (disp x2 + 
GBR) 

RO — (disp x 4+ 
GBR) | 


(disp + GBR) > sign 


extension — RO 
(disp x 2 + GBR) > 


sign extension > RO 


(disp x 4 + GBR) > 
RO 


Operation 
disp x 4+ PC — RO 


When T =0, disp x 2+ PC — PC; 
When T = 0, disp x 2 + PC — PC; 
When T = 1, disp x 2+ PC > PC; 


When T = 1, disp x 2 + PC > PC; 


Instruction Code Operation 
BF label 10001011dddddddda 

When T = 1, nop. 
BF/S label**  10001111dddddddd 

When T = 1, nop 

Br label 10001001dddddddd 

When T = 0, nop 
BT/S label**  10001101dddddddd | 

When T = 0, nop 
Notes: 2. SH-2 CPU instruction 


State 


1 
1 


1 


1 


State 


State | 


3/1*3 


an*3 | 


3/173 


a/1*8 


T Bit 


T Bit 


T Bit 


A.2.9 d12 Format 
Table A.44 d12 Format 


Instruction Code Operation State T Bit 
BRA label 1010dddddddddddd Delayed branch, disp x 2+ PC—-PC 2 — 


BSR label 1011dddddddddddd Delayed branching, PC + PR, dispx2 2 — 
+PC— PC 


A.2.10 nd8 Format 
Table A.45 nd8 Format 


Instruction Code Operation State T Bit 


MOV.W @(disp,PC),Rn 1001nnnndddddddd (disp x2 +PC) — sign 1 — 
extension — Rn 


MOV.L @(disp,PC),Rn 110innnndddddddd_ = (disp x 4+ PC) — Rn 1 — 


A.2.11 i Format 
Table A.46 Indirect Indexed GBR Addressing 


Instruction Code Operation State TBit 
AND.B #imm, @(RO,GBR) 11001101iiiiiiii (RO+ GBR) &imm > 3 — 
(RO + GBR) 
OR.B ‘ #imm, @(RO,GBR) 11001111iiiiiiii | (RO+GBR)limm—-> 3 — 
(RO + GBR) 
TST.B #imm, @(RO,GBR) 11001100iiiiiiii | (RO+ GBR) &imm, 3 Test 
| | when result is 0, 1 > T results 
XOR.B #imm, @(RO,GBR) 11001110iiiiiiii (RO+GBR)“imm> 3 = 
(RO + GBR) 
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Table A.47 Immediate Addressing (Arithmetic Logical Operation with Direct Register) 





Instruction Code Operation State TBit 

AND #imm, RO 11001001iiiiiiii RO&imm-— RO 1 — 

CMP/EQ #imm,RO  10001000iiiiiiii WhenRO=imm,1— 1. ~~ Comparison 

| | 3 | T results 
OR #imm,RO  11001011iiiiiiii ROlimm—RO 1 — 
TST | #imm,RO  11001000iiiiiiii | RO&imm,whenresult 1 Test results 
is0,1 —3T 
XOR #imm,RO 11001010iiiiiiii RO’ imm—RO 1 —_ 


Table A.48 Immediate Addressing (Specify Exception Processing Vector) 


Instruction Code Operation State T Bit 
TRAPA #imm 11000011iiiiiiii PC/SR — Stack area, (immx4+ 8 — 
| _VBR) — PC 


A.2.12 ni Format 
Table A.49 ni Format 


Instruction _ Code | Operation State TBit 
ADD #imm,Rn O1linnnniiiiiiii Rn + imm — Rn 1 — 
MOV #imm,Rn 1110nnnniiiiiiii imm — sign extension > Rn 1 — 





A.3 Instruction Set in Order by Instruction Code 


Table A.50 lists instruction codes and execution states in order by instruction code. 


Table A.50 Instruction Set by Instruction Code 





Instruction Code Operation State TBit 
CLRT 0000000000001000 0>T 1 0 
NOP 0000000000001001 No operation 1 — 
RTS 0000000000001011 Delayed branch,PR- 2 — 

! PC 
SETT 0000000000011000 1—3T 1 1 
DIVOU 0000000000011001 0 > M/Q/T 1 0 
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Table A.50 Instruction Set by Instruction Code (cont) 


Instruction Code Operation State T Bit 
SLEEP 0000000000011011 Sleep 3 — 
CLRMAC 0000000000101000 0 — MACH, MACL 1 — 
RTE 0000000000101011 Delayed branch, stack 4 LSB 
area — PC/SR 
STC SR, Rn 0000nnnn00000010 SR — Rn 1 — 
BSRF Rm*? 0000mmmm00000011 Delayed branch,PC > 2 — 
PR, Rm + PC > PC 
STS MACH, Rn 0000nnnn00001010 MACH — Rn 1 — 
STC GBR, Rn 0000nnnn00010010 GBR —> Rn 1 — 
STS MACL, Rn 0000nnnn00011010 MACL — Rn 1 — 
STC VBR, Rn 0000nnnn00100010 VBR — Rn 1 — 
BRAF Rm* 0000mmmm00100011 Delayed branch,Rm+ 2 — 
PC —+ PC 
MOVT Rn 0000nnnn00101001 T— Rn 1 — 
STS PR, Rn 0000nnnn00101010 PR — Rn 1 — 
MOV.B Rm, @(RO,Rn) 0000nnnnmmmm0100 Rm — (RO + Rn) 1 — 
MOV.W Rm, @(RO,Rn) 0000nnnnmmmm0101 Rm — (RO + Rn) 1 — 
MOV.L Rm, @(RO,Rn) 0000nnnnmmmm0110 Rm — (RO + Rn) 1 — 
MUL.L  Rm,Rn*? 0000nnnnmmmm0111 Rn x Rm —- MACL 2 — 
(to 4)*! 
MOV.B @(RO,Rm),Rn 0000nnnnmmmm1100 (RO + Rm) - sign 1 — 
extension > Rn 
MOV.W  @(RO,Rm),Rn 0000nnnnmmmm1101 (RO + Rm) > sign 1 — 
extension > Rn 
MOV.L @(RO,Rm),Rn 0000nnnnmmmm1110 (RO + Rm) — Rn 1 — 
MAC.L @Rmt+, @Rn+*? 0000nnnnmmmm1111 Signed, (Rn) x (Rm) + 3/ (2 — 
MAC — MAC to 4)*! 
MOV.L Rm,@(disp,Rn) 0001nnnnmmmmdddd Rm — (disp x4+Rn) 1 — 
MOV.B Rm, @Rn 0010nnnnmmmm0000 Rm — (Rn) 1 — 
MOV.W Rm, @Rn 0010nnnnmmmm0001 Rm — (Rn) 1 — 


Notes: 1. The normal minimum number of execution states (The number in parentheses is the 
number of states when there is contention with preceding/following instructions) 


2. SH-2 CPU instruction 
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Table A.50 Instruction Set by Instruction Code (cont). 


Code 
0010nnnnmmmm0010 
0010nnnnmmmm0100 


0010nnnnmmmm0101 
0010nnnnmmmm0110 


0010nnnnmmmm0111 


0010nnnnmmmm1000 


0010nnnnmmmm1001 
0010nnnnmmmm1010 


0010nnnnmmmm1100 


Operation 
Rm — (Rn) 


Rn-1— Rn, Rm > 
(Rn) 


Rn —-2 —- Rn, Rm > 
(Rn) 


Rn-4-— Rn, Rm > 
(Rn) 


MSB of Rn > Q, MSB 
of Rm >M,M4Q—-> 
T 


Rn & Rm, when result 
is0,1 3 T 


Rn & Rm > Rn 
Rn “Rm — Rn 
Rn! Rm — Rn 


When a byte in Rn 
equals a byte in Rm, 1 
oT 


State 


ee ee 


T Bit 


Calculation — 
result 


Test results 


0010nnnnmmmm1011 


Comparison 
result 


Center 32 bits of Rm 


0010nnnnmmmm1101 


0010nnnnmmmm1110 


and Rn > Rn 


1 to 3*! 


Unsigned, Rn x Rm > RP 


0010nnnnmmmm1111 


0011nnnnmmmm0000 


0011nnnnmmmm0010 


0011nnnnmmmm0011 


0011nnnnmmmm0100 


MAC 


Signed, Rn x Rm > 
MAC 


When Rn = Rm, 1 > T 


When unsigned and 
Rn = Rm, 1 3 T 


When signed and Rn 2 
Rm, 13 T 


1 to 3*! 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Calculation 


1-step division (Rn + 


instruction 

MOV.L Rm, @Rn 
MOV.B Rm, @-Rn 
MOV.L Rm, @~Rn 
DIVOS Rm, Rn 
TST Rm, Rn 
AND Rm, Rn 
XOR Rm, Rn 
OR Rm, Rn 
CMP/STR Rm,Rn 
XTRCT Rm, Rn 
MULU.W  Rm,Rn 
MULS .W Rm, Rn 
CMP/EQ kRm,Rn 
CMP/HS  Rm,Rn 
CMP/GE Rm,Rn 
DIV1 Rm, Rn 
DMULU.L Rm, Rn*? 
Notes: 


0011nnnnmmmm0101 


Rm) 


Unsigned, Rn x Rm > 
MACH, MACL 


1. The normal minimum number of execution states 
2. SH-2 CPU instruction 
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2 to 4*1 | 


result 


Table A.50 Instruction Set by Instruction Code (cont) 


Instruction 


CMP /HI Rm, Rn 
CMP/GT Rm, Rn 
SUB Rm, Rn 
SUBC Rm, Rn 
SUBV Rm, Rn 
ADD Rm, Rn 
DMULS.L Rm,Rn*? 
ADDC Rm, Rn 
ADDV Rm, Rn 
SHLL Rn 

SHLR Rn 
STS.L - MACH, @—-Rn 
STC.L SR, @-Rn 
ROTL Rn 

ROTR Rn 
LDS.L @Rm+, MACH 
LDC .L @Rm+,SR 
SHLL2 Rn 
SHLR2 Rn 

LDS Rm, MACH 
Notes: 


Code 


0011nnnnmmmm0110 


0011nnnnmmmm0111 


0011nnnnmmmm1000 
0011nnnnmmmm1010 


0011nnnnmmmm1011 


0011nnnnmmmm1100 
0011nnnnmmmm1101 


0011nnnnmmmm1110 


0011nnnnmmmml1111 


0100nnnn00000000 
0100nnnn00000001 
0100nnnn00000010 


0100nnnn00000011 


0100nnnn00000100 
0100nnnn00000101 
0100mmmm00000110 


0100mmmm00000111 


0100nnnn00001000 
0100nnnn00001001 
0100mmmm00001010 


2/1 


Operation 


When unsigned 
and Rn > Rm, 1 
oT 


When signed and 
Rn>Rm, 1 > T 


Rn—-Rm- Rn 


Rn-Rm-T-> 
Rn, borrow > T 


Rn — Rm —> Rn, 
underflow > T 


Rm + Rn > Rn 


Signed, Rn x Rm 
— MACH, MACL 


Rn +Rm+T-> 
Rn, carry > T 


Rn + Rm — Rn, 
overflow > T 


T<eRn<O 
0 Rn->T 


Rn—-4— Rn, 
MACH -—> (Rn) 


Rn—-4-— Rn, SR 
— (Rn) 


T — Rn — MSB 
LSB — Rn > T 


(Rm) — MACH, 
Rm+4— Rm 


(Rm) > SR, Rm 
+4—>Rm 


Rn<<2 — Rn 
Rn>>2 > Rn 
Rm — MACH 


1. The normal minimum number of execution states 
2. SH-2 CPU instruction 


State 


— 


2 to 4*1 


wah, 


ee ee ee 


—h_ | aonb 


T Bit 


Comparison 
result 


Comparison 
result 


Borrow 


Underflow 


Carry 
Overflow 


MSB 
LSB 


MSB 
LSB 


LSB 


Table A.50_ Instruction Set by Instruction Code (cont) 


Instruction 

JSR @Rm 

LDC Rm,SR 
DT Rn*2 
CMP/PZ Rn 

STS.L MACL, @—Rn 
STC.L GBR, @-Rn 
CMP/PL Rn 

LDS.L @Rm+,MACL 
LDC.L  @Rm+,GBR 
SHLL8 Rn 

SHLR8 Rn 

LDS Rm, MACL 
TAS.B  @Rn 

LDC Rm,GBR 
SHAL Rn 

SHAR Rn 

STS.L PR, @—Rn 
STC.L VBR, @-Rn 
ROTCL Rn 

ROTCR Rn 

LDS.L @Rm+,PR 
LDC.L  @Rm+,VBR 
Notes: 


Code 
0100mmmm00001011 


0100mmmm00001110 
0100nnnn00010000 
0100nnnn00010001 
0100nnnn00010010 
0100nnnn00010011 
pio mnOnol ole 
0100mmmm00010110 
0100mmmm00010111 


0100nnnn00011000 
0100nnnn00011001 
0100mmmm00011010 
0100nnnn00011011 


0100mmmm00011110 
0100nnnn00100000 
0100nnnn00100001 
0100nnnn00100010 


0100nnnn00100011 


0100nnnn00100100 
0100nnnn00100101 
0100mmmm00100110 


0100mmmm00100111 


2. SH-2 CPU instruction 


Operation 


Delayed branch, PC 
— PR, Rm > PC 


Rm > SR 


Rn - 1 — Rn; if Rnis 
0,1-—T, if Rnis 
nonzero, 0 > T 


Rn20,1—->T 


Rn — 4 — Rn, MACL 
— (Rn) 


Rn -—4— Rn, GBR > 
(Rn) 


Rn>0,1>T 


(Rm) — MACL, Rm + 
4— Rm 


(Rm) — GBR, Rm + 4 
— Rm 


Rn<<8 — Rn 
Rn>>8 > Rn 
Rm — MACL 


When (Rn) is 0, 1 > 
T, 1 — MSB of (Rn) 


Rm —+ GBR 
Te Rn<O 
MSB > Rn > T 


Rn —-4— Rn, PR > 
(Rn) 


Rn — 4 > Rn, VBR > 
(Rn) 


Te Rone 
To RnoT 


(Rm) > PR, Rm + 4 
— Rm 


(Rm) — VBR, Rm + 4 
— Rm 
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State 


or ee 


ee ee 


T Bit 


LSB 


Comparison 
result 


Comparison 
result 


Comparison 
result 


Test results 


MSB 
LSB 


MSB 
LSB 


Table A.50 Instruction Set by Instruction Code (cont) 


Instruction Code Operation State T Bit 
SHLL16 Rn 0100nnnn00101000 Rn<<16 — Rn 1 — 
SHLR16 Rn 0100nnnn00101001 Rn>>16 — Rn 1 — 
LDS Rm, PR 0100mmmm00101010 Rm — PR 1 — 
JMP @Rm 0100mmmm00101011 Delayed branch,Rm 2 — 
— PC 
LDC Rm, VBR 0100mmmm00101110 Rm — VBR 1 — 
MAC.W @Rm+, @Rn+ 0100nnnnmmmm1111 Signed, (Rn) x(Rm) 3/(2)*) — 
+ MAC — MAC 
MOV.L @(disp,Rm),Rn 0101nnnnmmmmdddd (disp + Rm) — Rn 1 — 
MOV.B  @Rm,Rn 011 0nnnnmmmm0000 (Rm) —> sign 1 — 
| extension — Rn 
MOV.W  @Rm,Rn 0110nnnnmmmm0001 (Rm) — sign 1 — 
extension — Rn 
MOV.L @Rm,Rn 0110nnnnmmmm0010 (Rm) > Rn 1 — 
MOV Rm, Rn 0110nnnnmmmm0011 Rm > Rn 1 — 
MOV.B @Rm+,Rn 0110nnnnmmmm0100 (Rm) — sign 1 — 
extension — Rn, Rm 
+1—Rm 
MOV.W  @Rm+,Rn 0110nnnnmmmm0101 (Rm) > sign 1 — 
extension > Rn, Rm 
+2—>Rm 
MOV.L @Rm+,Rn 011 0nnnnmmmm0110 (Rm) Rn,Rm+4 = 1 — 
— Rm 
NOT Rm, Rn 0110nnnnmmmm0111 ~Rm > Rn 1 — 
SWAP.B Rm,Rn 0110nnnnmmmm1000 Rm — Swap upper 1 — 
and lower halves of 
lower 2 bytes — Rn 
SWAP.W Rm,Rn 0110nnnnmmmm1001 Rm — Swap upper 1 — 
and lower word > 
Rn 
NEGC Rm, Rn 0110nnnnmmmm1010 O—Rm-T —- Rn, 1 Borrow 
borrow > T 
NEG Rm, Rn 0110nnnnmmmm1011 O0-Rm-— Rn 1 — 
Notes: 1 The normal minimum number of execution states (The number in parentheses is the 


number in contention with preceding/following instructions) 
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Table A.50 Instruction Set by Instruction Code (cont) 


Code 
0110nnnnmmmm1100 


0110nnnnmmmm1101 


0110nnnnmmmm1110 


0110nnnnmmmmi111 


10000000nnnndddd 
10000001nnnndddd 


10000100mmmmndddd 


10000101mmmmdddd 


10001001dddddddd 


10001101dddddddd 


10001011dddddddd 


10001111dddddddd 


1001nnnndddddddd 


1010dddddddddddd 


Instruction 

EXTU.B Rm,Rn 

EXTU.W Rm,Rn 

EXTS.B Rm,Rn 

EXTS.W Rm,Rn 

ADD #imm, Rn 
MOV.B RO, @(disp,Rn) 
MOV.W RO, @(disp, Rn) 
MOV.B @(disp,Rm),RO 
MOV.W @(disp,Rm),RO 
CMP/EQ #:imm,RO 

BT label 

BT/S label* 

BF label 

BEF/S label* 

MOV.W @(disp,PC),Rn 
BRA label 

Notes: 2. SH-2 CPU instruction 


3. One state when it does not branch 


274 


Operation 


Zero-extends Rm 
from byte — Rn 


Zero-extends Rm 
from word — Rn 


Sign-extends Rm 
from byte — Rn 


Sign-extends Rm 
from word — Rn 


Rn + imm — Rn 
RO — (disp + Rn) 


RO — (disp x2 + 
Rn) 


(disp + Rm) > sign 
extension — RO 


(disp x 2 + Rm) > 


sign extension — RO 


When RO = imm, 1 
+T 


When T = 1, disp x 
2 +PC > PC; 
When T = 0, nop. 


When T = 1, disp x 
2 +PC— PC; 
When T = 1, nop. 
When T = 0, disp x 
2 +PC— PC; 
When T = 0, nop 
When T = 0, disp x 
2+PC -— PC; 
When T = 1, nop 


(disp x 2 + PC) > 


sign extension — Rn 


Delayed branch, 
disp x2 +PC > 
PC 


State 
1 


3/1*8 


2/1*8 


3/1*8 


2/1*3 


1 


T Bit 


Comparison 
results 


Table A.50 Instruction Set by Instruction Code (cont) 


Instruction 


BSR 


MOV .B 


MOV .W 


MOV .L 


TRAPA 


MOV .B 


MOV .W 


MOV .L 


MOVA 
TST 


XOR 


OR 


TST.B 


AND .B 


XOR.B 


OR.B 


MOV .L 
MOV 


label 

RO, @(disp, GBR) 
RO, @(disp, GBR) 
RO, @( disp, GBR) 


#imm 


@(disp,GBR),RO 
@(disp,GBR),RO 
@(disp,GBR), RO 


@(disp,PC),RO 
#imm, RO 


#imm, RO 
#imm, RO 
#imm, RO 
#imm, @(RO,GBR) 
#imm, @(RO,GBR) 
#imm, @(RO,GBR) 


#imm, @(RO,GBR) 


@(disp,PC),Rn 
#imm, Rn 


Code 
1011dddddddddddd 


11000000dddddddd 
11000001dddddddd 


11000010dddddddd 


11000100dddddddd 


11000101dddddddd 


11000110dddddddd 


1100011 1dddddddd 
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Operation 
Delayed branch, PC 


— PR, disp x 2 + PC 


—+ PC 

RO — (disp + GBR) 
RO — (disp x2 + 
GBR) 

RO — (disp x4 + 
GBR) 

PC/SR — Stack 


area, (imm x 4 + 
VBR) — PC 


(disp + GBR) > sign 


extension — RO 
(disp x 2 + GBR) > 


sign extension — RO 


(disp x 4 + GBR) > 
RO 


disp x 4+ PC > RO 


RO & imm, when 
result is 0, 1 > T 


RO &imm — RO 
RO “imm — RO 

RO | imm — RO 
(RO + GBR) & imm, 


when result is 0, 1 > 


T 


(RO + GBR) & imm 
— (RO + GBR) 


(RO + GBR) *imm > 


(RO + GBR) 


(RO + GBR) | imm > 


(RO + GBR) 


(disp x 4 + PC) > Rn 


imm — sign 
extension — Rn 


State 


2 


io) 


T Bit 


Test results 


Test results 


A.4 Operation Code Map 


Table A.51 is an operation code map. 


~ Table A.51 Operation Code Map 


Instruction Code , : Fx:0001 


MSB LSB 


ERn iFx i 


"Ti 
>< 
So 
© 
o 
i] 


STC SR,Rn* |STC GBR,Rn | STC VBR, Rn 
ee oe 


MOV ..B MOV .W MOV.L MUL.L 
Rm, @(RO,Rn) Rm, @(RO, Rn) Rm, @( RO, Rn) Rm, Rn* 


CLRT SETT CLRMAC 


£0011 
!01MD 


:4000 





STS MACH,Rn |STS MACL,Rn | STS PR, Rn 


'11MDJ| Mov.B MOV. W MOV. L MAC. L 
@(RO,Rm),Rn @(RO,Rm),Rn @(RO,Rm),Rn @Rm+, @Rn+* 


69) 





idisp |MOV.L Rm, @(disp:4,Rn) 


{O0OMD] MoV.B Rm,@Rn |MOV.W Rm, @Rn |MOV.L Rm, @Rn 


MOV .B MOV .W MOV.L DIVOS Rm,Rn 
Rm, @-Rn Rm, @-Rn Rm, @-Rn 


ae i ae a oe 


CMP/STR XTRCT Rm,Rn MULU .WRm, Rn MULS .W Rm, Rn 
Rm, Rn 


po. Nees Naeem 


DIV1L Rm,Rn DMULU .L CMP/HI Rm, Rn CMP/GT Rm, Rn 
Rm, Rn* 


son avn | —*( stein [S06 Rav 

ADD Rm, Rn DMULS.L ~ pe Rm, Rn 
Rm, Rn* 

—— 

= 


:01MD 


'11MD 


!01MD 


=10MD 





:0000 





'0001 


2/6 


Table A.51 Operation Code Map (cont) 


Instruction Code [Fx:0000 Fx: 0001 | Fx: 0010 | Fx: 0014-1494 
MSB LSB MD:o1MD:40 


{Fx $0010 |sts.1 STS.L STS.L 
MACH, @—Rn MACL, @—Rn PR, @-Rn 
£0011 |stTc.1L STC.L STC.L 

SR, @-Rn GBR, @-Rn VBR, @—Rn 


:0100 ROTL Rn ROTCL Rn 


ROTR Rn CMP/PL Rn ROTCR Rn 

LDS .L LDS .L LDS .L 
@Rm+ , MACH @Rm+ , MACL @Rm+,PR 

LDC.L LDC.L LDC. L 
@Rm+,SR @Rm+, GBR @Rm+, VBR 


SHLL2 Rn SHLL8 Rn SHLL16 Rn 


SHLR2 Rn SHLR8 Rn SHLR16 Rn 
LDS Rm,MACH |LDS - Rm,MACL | LDS Rm, PR 


JSR @Rm TAS.B @Rn JMP @Rm 
LDC Rm,SR LDC Rm,GBR | LDC Rm, VBR 


MAC.W @Rmt+, @Rn+ 





£0110 
£0111 


£1000 


£1110 





MOV.L @(disp:4,Rm),Rn 


MOV.B Rm,Rn MOV.W @Rm,Rn |MOV.L @Rm,Rn 
MOV.B Rm+,Rn | MOV.W GRtt,Rn MW.L@Rmt+, Rn 


SWAP .B SWAP .W NEGC kRm,Rn 
Rm, Rn Rm, Rn 
EXTU.B Rm,Rn|EXTU.W Rm,Rn | EXTS.B’ Rm,Rn 


D #imm:8,Rn 





disp 

:0OMD 
:01MD 
:10MD 


MOV Rm, Rn 
NOT Rm, Rn 
NEG Rm, Rn 


EXTS.W Rm,Rn 





[Rm i11MD 
ERn imm 
[OOMD:Rn disp 








is 







MW.B- RO, MW.W- RO, 
@(disp:4,Rn) @(disp:4,Rn) 
MOV.B MOV .W 
@(disp:4, @(disp:4, 
Rm), RO Rm), RO 













1000 :01MD:Rm disp 














CMP /EQ BT label :8 BF label:8 


#imm:8,RO 
BT/S 
label: 8* 


2/7 


10MD: imm/disp 


BF/S 
label : 8* 


1000 #11MD: imm/disp 


Table A.51 Operation Code Map (cont) 


instruction Code Fx: 0000 Fx:0001 = J Fx:0010 











































1100 {00MD: imm/disp |Mov.B RO, MOV.W RO, MOV.L RO, TRAPA #imm:8 
@(disp:8, @(disp:8, @(disp:8, 
GBR) GBR) GBR) 














MOV .B MOV .W MOV.L MOVA 














@(disp:8, @(disp:8, @(disp:8, @(disp:8, 
GBR) , RO PC) ,RO 
TST AND - | XOR OR 
. #imm:8,RO #imm:8,RO #imm:8,RO #imm:8,RO 
TST .B AND .B XOR.B OR.B 
#imm:8, #imm:8, #imm:8, #imm:8, 





@(RO,GBR) @(RO,GBR) @(RO,GBR) @(RO,GBR) 


MOV.L @(disp:8,PC),RO 
MOV #imm:8,Rn | 


Note: SH-2 CPU instructions 
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Appendix B Pipeline Operation and Contention 


The SH-1 and SH-2 CPU is designed so that basic instructions are executed in one state. Two or 
more states are required for instructions when, for example, the branch destination address is 
changed by a branch instruction or when the number of states is increased by contention between 
MA and IF. Table B.1 gives the number of execution states and stages for different types of 
contention and their instructions. Instructions without contention and instructions that require 2 or 
more cycles even without contention are also shown. 


Instructions experience contention in the following ways: 
¢ Operations and transfers between registers are executed in one state with no contention. 
¢ No contention occurs, but the instruction still requires 2 or more cycles. 


¢ Contention occurs, increasing the number of execution states. Contention combinations are as 
follows: 


— MAcontends with IF 

— MA contends with IF and sometimes with memory loads as well 

— MA contends with IF and sometimes with the multiplier as well 

— MA contends with IF and sometimes with memory loads and sometimes with the multiplier 
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Table B.1 Instructions and Their Contention Patterns 


Contention 
None 


MA contends with IF 


MA contends with IF and 
sometimes with memory loads as 
well 


MA contends with IF and 
sometimes with the multiplier as 
well 


MA contends with IF and 
sometimes with memory loads 
and sometimes with the multiplier 


State Stage 
1 3 
2 3 
3/1*3 3 
3 3 
4 5 
8 9 
1 4 
2 4 
3 6 
4 6 
1 5 
3 5 
1 4 


1to3 6/7"! 
*2 


3/(2)** 7/8"! 


3/(2to 9 
4)*2 

2to4*2 Q 
1 5 


instruction 
Transfer between registers 


Operation between registers (except 
multiplication instruction) 


Logical operation between registers 
Shift instruction 

System control ALU instruction 
Unconditional branche 

Conditional branche 

SLEEP instruction 

RTE instruction 

TRAP instruction 


Memory store instruction and STS.L 
instruction (PR) 


STC.L instruction 
Memory logic operations 
TAS instruction 


Memory load instructions and LDS.L 
instruction (PR) 


LDC.L instruction 


Register to MAC transfer instruction, 
memory to MAC transfer instruction and 
MAC to memory transfer instruction 


Multiplication instruction 


Multiply/accumulate instruction 


Double-length multiply/accumulate 
instruction (SH-2 only) 


Double-length multiplication instruction 
(SH-2 only) 


MAC to register transfer instruction 


Notes: 1. With the SH-2 CPU, multiply/accumulate instructions are 7 stages and multiplication 
instructions are 6 stages, while with the SH-1 CPU, multiply/accumulate instructions are 
8 stages and multiplication instructions are 7 stages. 
2. The normal minimum number of execution states (The number in parentheses is the 
number in contention with preceding/following instructions). 


3. One stage when it does not branch. 
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18 Floor Kukje Center Building 
191, 2-Ka, Hanggang—Ro 
Yongsan-Ku, Seoul, Korea 
Tel: 796-3115, 3647 to 8 
Fax: 796-2145 


Hitachi Asia (Hong Kong) Ltd. 

Beijing Office 

Room 1412, Beijing Fortune Building, 
5 Dong San Huan, Bei-lu, 

Chaoyang District Beijing 

People’s Republic of China 

Tel: 501-4351-4 

Fax: 501-4350 
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